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Abstract. There is a constant Co such that all nonabelian finite simple 
groups of rank n over W q , with the possible exception of the Ree groups 
2 G2(3 2e+1 ), have presentations with at most Co generators and relations and 
total length at most Co (log n + logg). As a corollary, we deduce a conjecture 
of Holt: there is a constant C such that dim H 2 (G,M) < CdimM for every 
finite simple group G, every prime p and every irreducible F p G-module M. 
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1. Introduction 

In this paper we study presentations of finite simple groups G from a quantita- 
tive point of view. Our main result provides unexpected answers to the following 
questions: how many relations are needed to define G, and how short can these 
relations be? 

The classification of the finite simple groups states that every nonabelian finite 
simple group is alternating, Lie type, or one of 26 sporadic groups. The latter 
are of no relevance to our asymptotic results. Instead, we will primarily deal with 
groups of Lie type, which have a (relative) rank n over a field ¥ q . In order to keep 
our results uniform, we view the alternating group A n and symmetric group S n as 
groups of rank n — 1 over "the field Fi with 1 element" [Til]. With this in mind, 
we will prove the following 

Theorem A. All nonabelian finite simple groups of rank n over ¥ q , with the 
possible exception of the Ree groups 2 G 2 (?), have presentations with at most Co 
generators and relations and total length at most Co (log n + logg), for a constant 
Co independent of n and q. 1 

We estimate (very crudely) that Co < 1000; this reflects the explicit and con- 
structive nature of our presentations. The theorem also holds for all perfect central 
extensions of the stated groups. 

The theorem is interesting in several ways. It already seems quite surpris- 
ing that the alternating and symmetric groups have bounded presentations, i.e., 
with a bounded number of generators and relations independent of the size of the 
group, as in the theorem (this possibility was recently inquired about in [CHRR, 
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p. 281]). This was even less expected for the groups of Lie type such as PSL(n, q). 
Mann [Man] conjectured that every finite simple group G has a presentation with 
0(log|G|) relations (this is discussed at length in [LS, Sec. 2.3]). As a simple 
application of [BGKLP], this was proved in [Man] with the possible exception of the 
twisted rank 1 groups (the Suzuki groups 2 B 2 (q) = Sz(q), Ree groups 2 G2(q) = R(q) 
and unitary groups 2 A 2 (q) = PSU(3,g); in fact, by [Suz, HS] Mann's result also 
holds for Sz(q) and PSU(3,g)). Theorem A clearly goes much further than this 
result, providing an absolute bound on the number of relations. 

The length of a presentation is the sum of the number of generators and the 
lengths of the relations as words in both the generators and their inverses. A presen- 
tation for a group G in the theorem will be called short if its length is 0(log n+log q) . 
(A significantly different definition of "short" , used in [BGKLP] , involves a bound 
0((log |G|) C ), i.e., 0((n 2 \ogq) c ) ifq> 1.) When q > 1, the standard Steinberg pre- 
sentation [Stl, St2] for a group in the theorem has length 0(n 4 q 2 ) (with a slightly 
different bound for some of the twisted groups) . In [BGKLP] (when combined with 
[Suz, HS] for the cases Sz(g) and PSU(3, q)), the Curtis-Steinberg-Tits presentation 
[Cur], [Ti2, Theorem 13.32] was used to prove that, once again with the possible 
exception of the groups 2 G 2 (q), all finite simple groups G have presentations of 
length 0((log |G|) 2 ) (i.e., 0((n 2 log*?) 2 ) if q > 1) - in fact of length 0(log \G\) for 
most families of simple groups. Theorem A clearly also improves this substantially. 
Note that our 0(log(nq)) bound on length is optimal in terms of n and q, since there 
are at least enq /log q groups of rank at most n over fields of order at most q (see 
Section 8). On the other hand, whereas our theorem shows that nonabelian finite 
simple groups have presentations far shorter than 0(log |G|), [BGKLP] (combined 
with [Suz, HS]) showed that every finite group G with no 2 G2{q) composition factor 
has a presentation of length 0((log |G|) 3 ), where the constant 3 is best possible. 

Theorem A seems counterintuitive, in view of the standard types of presentations 
of simple groups. We have already mentioned such presentations for groups of Lie 
type, but symmetric and alternating groups are far more familiar. The most well- 
known presentation for S n is the "Coxeter presentation" (2.5) (discovered by Moore 
[Mo] ten years before Coxeter's birth). It uses roughly n generators and n 2 relations, 
and has length roughly n 2 ; others use 0(n) relations [Mo, Cox]. We know of no 
substantially shorter presentations in print. However, independent of this paper, 
[BCLO] obtained a presentation of S n that is bounded and has bit-length O(logn); 
however, it is not short in the sense given above. (Section 1.2 contains a definition of 
bit-length, together with a discussion of various notions of lengths of presentations 
and their relationships. For example, it is straightforward to turn a presentation 
having bit-length N into a presentation having length AN, but generally at the cost 
of introducing an unbounded number of additional relations.) 

Either bounded or short presentations for nonabelian simple groups go sub- 
stantially beyond what one might expect. Obtaining both simultaneously was a 
surprise. By contrast, while abelian simple groups (i.e., cyclic groups of prime 
order) have bounded presentations, as well as ones that are short, they cannot have 
presentations satisfying both of these conditions (cf. Remark 1.2). In fact, this 
example led us to believe, initially, that nonabelian simple groups also would not 
have short bounded presentations. A hint that nonabelian finite simple groups be- 
have differently from abelian ones came from the Congruence Subgroup Property, 
which can be used to obtain a short bounded presentation for SL(2,p) when p is 
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prime (see Section 3.1). This approach was later replaced by an elementary but 
somewhat detailed one (see Section 3.2), which became increasingly more intricate 
as we moved to PSL(2, q) and other rank 1 groups (Sections 4.1-4.4). The PSL(2,p) 
presentation also was used to obtain a bounded and short presentation for S n (Sec- 
tion 3.4); when combined with presentations for rank 1 groups the latter was then 
used to prove the theorem for all groups of large rank (Section 6). Our arguments 
could have been considerably simplified if we had been willing to give up one of the 
two features of the presentations in Theorem A. A modified version of Theorem A 
is given in [GKKL2] that uses significantly fewer relations in presentations that are 
no longer short. 

In order to emphasize our special interest in bounded presentations, we note 
the following consequence of the above theorem, which follows immediately from 
Lemma 2.1 together with the fact that, by the classification of the finite simple 
groups, every such group can be generated by two elements: 

Corollary A'. There is a constant C such that every finite simple group, with the 
possible exception of the Ree groups 2 G2(q), has a presentation having 2 generators 
and at most C relations. 

In [GKKL2] we show that C < 100 by ignoring presentation length. The groups 
2 G2(q) are obstacles for all of these results: no short or bounded presentation is 
presently known for them. We can handle these offending exceptions by changing 
our focus either to cohomology or to profinite presentations. We say that a finite or 
profinite group G has a profinite presentation with d generators X and r relations 
Y if there exists a continuous epimorphism Fd — ► G, where Fd is the free profinite 
group on a set X of d generators, whose kernel is the minimal closed normal sub- 
group of Fd containing the r-element subset Y. Then in the category of profinite 
presentations there are no exceptions: 

Theorem B. There is a constant C such that every finite simple group G has a 
profinite presentation with 2 generators and at most C relations. 

The quantitative study of profinite presentations of finite simple groups was 
started in [Lubl], motivated by an attempt to prove the Mann-Pyber conjecture 
[Man, Py], which asserts that the number of normal subgroups of index n of the 
free group Fd is O(n cdlog ") for some constant c. Mann [Man] showed that his 
conjecture about 0(log |G|)-relation presentations for finite simple groups implies 
the Mann-Pyber conjecture; and hence he proved that conjecture except for the 
twisted rank 1 groups. 

In [Lubl] the Mann-Pyber conjecture was proved by using a weaker version of 
Mann's conjecture: There is a constant C such that every finite simple group G 
has a profinite presentation with 2 generators and at most Clog|G| relations. The 
crucial ingredient in the proof of the latter result was a theorem of Holt [Ho] : There 
is a constant C such that, for every finite simple group G, every prime p and every 
irreducible ¥ p G-module M, 

(1.1) dim H 2 (G,M) < G(log p |G| p )dimM, 

where |G| P denotes the p-part of the integer |G|. In fact, Holt proved that G can 
be taken to be 2, using tools including standard cohomology methods together 
with results in [AG]. Moreover, Holt [Ho] conjectured the following stronger result, 
which we will see is equivalent to Theorem B: 
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Theorem B' (Holt's Conjecture for simple groups). There is a constant C such 
that, for every finite simple group G, every prime p and every irreducible F P G- 
module M, dim H 2 {G,M) < C dim M. 

In a subsequent paper [GKKL1] we will show that the constant in Theorems B 
and B' can be taken to be less than 20 (and is probably even smaller than this). 
The proof uses an interplay of cohomological and profinite presentation results. 

More generally Holt proved that (1.1) holds for any finite group G and any 
faithful irreducible F p G-module M; his proof reduced this to the simple group 
case. Similarly, using a different reduction, in [GKKL1] we use Theorem B' to 
prove a stronger conjecture made by Holt: 

Theorem B" (Holt's Conjecture). There is a constant C such that, for every finite 
group G, every prime p and every faithful irreducible ¥ p G-module M, dim H 2 (G, M) 
< GdimM. 

As noted above, Theorem A does not require the classification of the finite sim- 
ple groups: it only deals with groups having rank n over ¥ q . Of course, by the 
classification this ignores only the 26 sporadic simple groups. On the other hand, 
Theorems B, B' and B" do use the classification, as does Holt's proof of (1.1). 
However, unlike many papers by some of the authors, no classification-dependent 
internal structural properties of these groups are required for the proof. 

We emphasize that our methods are all essentially elementary: the needed struc- 
tural properties of the groups are standard, and the only number theory used is 
"Bertrand's Postulate" , a weak precursor to the Prime Number Theorem. Finally, 
we note that most of our presentations contain standard structural aspects of vari- 
ous simple groups: we do not eliminate or otherwise fine-tune relations in a manner 
standard in this subject (there are many examples of the latter process in [CoMo]). 

1.1. Outline of the proofs. In Section 7 we will show that Theorems B and B' 
are equivalent, and that they follow from Corollary A' except for the Ree groups 
2 G2(q)- In order to get a smaller constant in Theorem B, a detailed estimation of 
second cohomology groups is needed using (7.3). This is postponed to a later paper 
[GKKL1] . 

Sections 3-6 contain the proof of Theorem A. Groups of Lie type are built out 
of symmetric groups (and related Weyl groups) together with rank 1 groups of Lie 
type. Most of our efforts are directed towards these two classes of groups. 

In Section 3 we obtain bounded presentations of length O(logn) for symmetric 
and alternating groups. Our approach is based on the groups PSL(2,p). Therefore 
we first have to handle that case in Sections 3.1 and 3.2. A short bounded pre- 
sentation for PSL(2,p) already requires having the presentation encode exponents 
up to p in binary; this is accomplished using a trick from [BKL]: a group-theoretic 
version of "Horner's Rule" (3.3). 

We combine our presentation for PSL(2, p) with the usual embedding PSL(2, p) < 
Sp+i in order to obtain a short bounded presentation for S p +2 whenever p > 3 is 
prime. Then we glue together such presentations for two copies of S p +2 m order to 
obtain a short bounded presentation for the general symmetric group S n . 

We deal with rank 1 groups (especially SL(2, q), SU(3, q) and Sz(g)) in Section 4. 
For these groups we have to deal with several obstacles: 

• Borel subgroups do not have short bounded presentations. However, we need 
to use a standard presentation of Steinberg [St2, Sec. 4], based on rank 1 
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BN-pairs, that is built from a presentation for a Borcl subgroup B together 
with other data (cf. Theorem 4.34). (For example, in the case SL(2,g) 
the subgroup B is a semidirect product F+xF* and hence does not have a 
short bounded presentation; cf. Remark 1.2.) 

As a substitute for a short bounded presentation for B, we find one 
for an infinite central extension of B, after which almost all of the center 
is killed when we include the remaining ingredients in a presentation for 
groups such as SL(2, q). 

• Somewhat long relations, such as x p = 1, need to be handled, once again 
using Horner's Rule. 

• Elementary abelian subgroups of order q and arbitrary rank have to be 
shown to be abelian by using only a bounded number of relations. For this 
purpose we need to significantly generalize an idea due to Baumslag [Bau] 
and Campbell, Robertson and Williams [CRW1] so that we can even handle 
nonabelian subgroups of unitary and Suzuki groups. 

• Longer relations, such as a^ 9-1 )/ 2 = 1, have to be handled. This is accom- 
plished primarily by using the minimal polynomial of a field element of the 
required order as a replacement for the exponent. 

Thus, already in rank 1 we have to put significant effort into handling the restric- 
tions imposed in Theorem A. 

The rank 1 presentations, combined with the Curtis-Steinberg-Tits presenta- 
tions [Cur], [Ti2, Theorem 13.32], are used in Section 5 to obtain short bounded 
presentations for bounded rank groups. Bounded (but not short) presentations for 
untwisted groups of bounded rank were obtained in [KoLu] . Starting with our rank 
1 results, all bounded rank groups can be readily handled somewhat as in [KoLu] 
(and in fact this is essentially in [BGKLP]). Fortunately, the rank 1 group 2 G2(q) 
docs not arise as a Levi subgroup of any higher rank group. 

Finally, in Section 6 we first consider SL(n,q), and then show how to extend 
the results from both SL(n,q) and bounded rank to the universal covers of all 
groups of Lie type (except 2 G2(q)), using the Curtis-Steinberg-Tits presentation. 
For all groups of large rank we go to some additional effort to increase our sets 
of generators and relations somewhat in order to obtain a presentation in which 
a suitable element of the center is a short word in the generators, and hence the 
center can be factored out within the constraints of Theorem A.. 

1.2. The lengths to which we could go. Table 1 contains a summary of the 
preceding outline, together with information concerning different notions of lengths 
of a presentation (X \ R) : 

length = word length: \X\ + sum of the lengths of the words in R within the 
free group on X; thus, length refers to strings in the alphabet XUX -1 . 
This is the notion of length used in this paper. Mild variations, the 
same as ours up to a constant, are used by practitioners of the art 
of computing explicit presentations for groups. For example, [CHRR] 
just uses the above length-sum, while [Do] uses \R\ + the length-sum. 
As noted in [Sim, pp. 290-291]: "There is no universal agreement as to 
when one presentation is simpler than another. Usually it is considered 
good if the number of generators can be reduced or the total length of 
the relators can be made smaller.... The real challenge is to reduce both 
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the number of generators and the total length of the presentation." A 
similar notion appears in the discussion of length in [HEO, p. 184]: 
"As a measure of the complexity of a group presentation, we formally 
define its size to be the triple (\X\, \R\, I), where I is the total relator 
length." 

plcngth = power length: \X\ + total number of powers appearing in R. 

blength = bit-length: total number of bits required to write the presentation, 
used in [BGKLP] and [BCLO]. In particular, all exponents are encoded 
as binary strings, the sum of whose lengths enters into the blength, 
as does the space required to enumerate the list of generators and 
relations. This definition seems especially suitable from the point of 
view of Computer Science. It also makes results similar to those given 
here significantly easier to prove (see [GKKL2]). 

Some of our presentations can be modified so as to have these other lengths, as 
indicated in Table 1, where we write q = p e with p prime. However, we definitely do 
not provide presentations having all of the indicated types of lengths simultaneously. 

Example 1: (x \ x 11 ) has length 1 + n, plength 2, and blength 2 + [logn]. 

Example 2: If k — ^™<Zj2 l with <E {0,1}, the power x k can be replaced 

by n™^ 2 ) ai — ritT^r f° r new generators Xi and relations x — x ,Xi = x\_ x 
(1 < i < m). 

This multiplies bit-length by at most a factor of 4 in order to obtain length. Thus, 
we see that length and bit-length are essentially the same, except if presentations 
are to be bounded, as in this paper. For bounded presentations, bit-length can be 
far smaller than length. Clearly, plength is always at most length. We also note 
that the computer algebra system Magma stores relations as written: it docs not 
introduce new relations (as in this example) unless asked to do so. 

This example shows that a cyclic group of order n has a presentation of length 
O(logn). However, there is no such short presentation with a bounded number of 
relations: 

Remark 1.2. Each family of cyclic groups C n whose orders are unbounded can- 
not have bounded presentations of length O(logn). To see this, assume that C„ 
has a presentation with a bounded number of generators xi,. . . ,Xd and relations 
wi, . . . ,w r . Include the bounded number of additional relations [xi, Xj] = 1, thereby 
turning our presentation into one in the category of abelian groups. Then every Wi 
can be viewed as a vector in the group Z d with basis x\,. . .,Xd- We can renumber 
the Wi so that w\,. . . ,Wd generate a subgroup of finite index, at least n, in 1 d . 
Then the vectors w\ , . . . , Wd are the rows of a matrix whose determinant is at least 



Table 1. Different lengths 



Sec. 


groups 


# relations 


length 


plength 


bit-length 


3 
4 
5 

6.1 
6.2 


rank 1 
rank n 
PSL(n,g) 
large n 


0(1) 
0(1) 
0(n 2 ) 
0(1) 
0(1) 


O(logn) 
O(logg) 
0(n 2 logq) 
0(logn + logg) 
0(logn + logg) 


0(1) 

O(e) 
0(n 2 + e) 
0(e) 
0(e) 


O(logn) 

O(logg) 
0(n 2 logn log q) 
0(logn + logg) 
0(logn + \ogq) 
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n, and which is also a polynomial of degree d in the entries of the Wi. Hence, at 
least one of the entries of some Wi is at least n 1 / d /d\, which means that the corre- 
sponding exponent of some Xj appearing in Wi is at least that large. Consequently, 
the length of the presentation is at least n 1 ^ /d\, and hence is certainly not O(logn) 
since d is bounded. 

It follows immediately that various other groups cannot have short bounded pre- 
sentations. For example, the l-dimensional affine group AGL(1,<7) cannot have a 
bounded presentation of length 0(logq). (Namely, if we mod out by the commuta- 
tors of the generators in a presentation for this group then we obtain a presentation 
for a cyclic group of order q — 1 . ) 

1.3. Historical remarks. There seems to have been an effort, at least in the 1930's 
but also as far back as 1897, to obtain presentations for symmetric and alternating 
groups involving as few generators and relations as possible. For example, Moore 
[Mo] gave not only the Coxeter presentation (2.5) for S n but also one with 2 gen- 
erators and approximately n relations. Many such presentations are reproduced in 
[CoMo, Sec. 6.2]. In [CHRR, p. 281] we find the question: "Is there a two-generator 
presentation for A n with k relators, where k is independent of n?" Our Theorem 
A, as well as [BCLO], answer this question. We note that [CHRR] lists various 
presentations of small simple groups, using as one of the criteria for "niceness" the 
short lengths of all relations. 

There has been a great deal of research on presentations for relatively small 
simple groups, and for small-dimensional quasisimple groups. Extensive tables are 
given in [CoMo, pp. 134-141]. Special emphasis is given to PSL(2,p) and PGL(2,p) 
in [CoMo, Sec. 7.5]. An indication of the large amount of more recent work of this 
sort can be found in the references in [CCHR, CHRR, CR1, CRW1, CRW2] . 

On the other hand, there are few general references containing presentations for 
groups of Lie type; cf. [Cur, Stl, St2, Ti2, Theorem 13.32] - described in [GLS, 
Sec. 2.9] - and their consequences [BGKLP, KoLu]. However, there are a few 
specific groups for which presentations have been published; [CRW2] and [CHLR] 
are typical. The only reference containing a hint in the direction of Corollary A' is 
[Wi], which contains a more precise conjecture than the above question in [CHRR], 
namely, that the universal central extension of every finite simple group has a 
presentation with 2 generators and 2 relations. 

Finally, we mention a recent, very different type of presentation for some groups 
of Lie type, initiated in [Ph] and continued very recently in [BS, GHNS, GHS]. 
While these methods can be used to obtain presentations of unitary groups with 
fewer relations than here (see [GKKL2]), in this paper we prefer to deal with all 
classical groups in a more uniform manner. 

2. Elementary preliminaries 

2.1. Presentations. We begin with some elementary observations concerning pre- 
sentations. Functions will always act on the left, and we use the notation g h = 
h^gh and [g, h] = g~ 1 h~ 1 gh = g~ 1 g h ■ 

Let d(G) denote the minimum number of elements needed to generate G. 

Lemma 2.1. If G = (D) is a finite group having a presentation (X | i?), then G 
also has a presentation (D' \ R') such that \D'\ = \D\ and the natural map Fd' — > G 
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sends D' bijectively to D, and \R'\ = \D\ + \R\. In particular, G has a presentation 
with d(G) generators and d(G) + \R\ relations. 

Proof. Write each x € X as a word v x (D) in D and each d € D as a word w d (X) 
in X. Let V(D) = {v x (D) \ x e X}. Let d ^ d' be a bijection from D to D' . 

We claim that (D' \ d! = w d (V(D')), r(V(D')) = 1, d e D, r e R) is a presenta- 
tion for G. For, let G be the presented group, and let tt: Fjj> — > G be the natural 
surjection. 

If if := (7r(V(D'))) < G, then H = G since G is generated by the elements 
Tr(d') = 7r(«;d(V(D / )) eH,deD. 

Also, i? is a homomorphic image of G since 7? is generated by elements tt(v x (D')), 
x e X, satisfying the defining relations for G: if r £ R then r(7r(v x {D'))) — 1 holds 
in G = H. 

Finally, G is a homomorphic image of G since the map d! \— ► c? sends the gener- 
ators of G to generators of G satisfying the same relations as those satisfied in G 
by n(D'). □ 

Lemma 2.2. Let H be a subgroup of index m in G. 

(i) (a) If G has a presentation with x generators and r relations, then H has 

a presentation with m(x — 1) + 1 generators and mr relations. 

(b) If the presentation in (a) has length I, then the corresponding presen- 
tation of H has length at most ml. 

(c) The length of an element of H with respect to the above generators of 
H is at most its length with respect to the original generators of G. 

(ii) If H has a presentation with y generators and s relations, then G has a 
presentation with d(G) generators and md(G) + s relations (independent 
ofy)- 

Proof, (i) Part (a) follows from the standard Rcidcmcister-Schrcicr algorithm [MKS, 
Sees. 6.3, 6.4]: the given presentation for G produces an explicit presentation for H, 
with the stated numbers of generators and relations. Moreover, for (b) note that 
each relation of G of length / gives rise to m relations for H , each of length / in the 
generators of H (cf. [HEO, p. 184]). Finally, for (c) see [Ser, p. 58]. 

(ii) Let tt: F d r G \ — > G be a surjection with kernel N. As in (i), L = 7r _1 (_ff) is a 
free group on a set X of n = m(d(G) — 1) + 1 generators. By Lemma 2.1, H has a 
presentation using the set ir(X) of these n generators together with n + s relations. 
Consequently, N is the normal closure in L - and hence also in -Fd(G) - of a set of 
n + s < md(G) + s elements. □ 

We note that we do not know a length version of part (ii) analogous to (b) or 
(c). 

In the future we will want to identify subgroups of our target group with sub- 
groups of the group given by a presentation: 

Lemma 2.3. Let tt: F X uy -> G = (X,Y \ R,S) and A: F x -» H = (X | R) be the 
natural surjections, where H is finite. Assume that a: G — > Go is a homomorphism 
such that o.(tt(X)) = H. Then (tt{X)) = H. 

Proof. Clearly a sends (n(X)) onto a((n(X))) ^ H. If r e R then r(n(X)) = 1, 
so that A induces a surjection H = (X \ R) — > (tt(X)). □ 
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From now on we will be less careful than in the preceding three lemmas: we will 
usually identify the generators in a presentation with their images in the presented 
group. 

Proposition 2.4. Suppose that G has a presentation (X \ R) in which R has total 
length L. If G is a perfect group such that G/Z = G with Z < Z(G) of prime order 
p, then G has a presentation (X \ R) such that \X\ = \X\ + 1, |i?| = \X\ + \R\ + 1, 
the length of R is less than 4\X\ + 2L + p\R\, and X contains a generator of Z. 

Proof. Since G/Z = G, we can lift the surjection <j>: F = Fx — > G, with kernel 
N := (R F ), to a map 4>: F — > G (just send each x € X to a lift of 4>{x) in G); since 
Z is contained in the Frattini subgroup of G, <j> is surjective. Then K := ker</> < N 
and \N/K\ = p. 

Let X = Xi){y}. We may assume that n G R is not in K, so that N = K{r±). 
For each r £ JJ choose e r such that < e r < p and s r (ri) := rr\ T e K. Then 
Kx := (fi F ) < where # := {t/rf 1 , [j/, or], s r (j/) | a; G X,r e R}. 

We claim that = K. First note that (ri,Ki) is normal in F since [ri, Jf] C 
K\. Then N = (n,Ki) since each r = s r (y)r^ Cr e (n,Ki), so that N/K 1 = 
(nXi) with r? = s ri (y) e Since K x < K and |iV/A"| = p, it follows that 

K\ = F, as claimed. 

Clearly y represents a generator for Z. The total length of _R is at most 
(L + I) + 4\X\ + L + (p - l)\R\. □ 

2.2. Some elementary presentations. The most familiar presentation for the 
symmetric group S n is the "Coxeter presentation" [Mo]: 

^2 g\ ^ n {^1? • • • 5 %n—l I — (^-i^i+l) — (^i^j) 1 

for all possible i and for i + 2 < j < n — 1), 

based on the transpositions + We will often use the following presentation 
based on the transpositions 

(2.6) S n = (x 2 , . .. ,x n | x\ = (xiXj) 3 — (xiXjXiXk) 2 = 1 for distinct i,j,k). 

This presentation is due to Burnside [Bur, p. 464] and Miller [Mil, p. 366] in 
1911; Burnside describes it as probably "the most symmetrical form into which the 
abstract definition [of S n ] can be thrown" , an idea that is crucial for our use of it in 
the next section. Carmichael [Carm, p. 169] observed that this presentation can be 
considerably shortened: only the relations (xiXjXiXk) 2 = 1 with j — i + 1 need to be 
used. Before we learned of (2.6), we had been using the following presentation we 
had found, based on the same transpositions, which may be even more symmetrical 
than the preceding one: 

S n = (x2, ■ ■ ■ ,x n | x\ = (xiXj) 3 = (xiXjXk) 4 = 1 for distinct i, j, k). 

For alternating groups, see [GKKL2]. 

There is an analogue of (2.6) for linear groups. The usual Steinberg presentation 
[Stl] for SL(n,p), n > 3, is as follows (where represents the elementary matrix 
with 1 on the diagonal, i, j-entry 1 and all other entries 0): 

Generators: for 1 < i ^ j < n. 

(2.7) Relations: = 1, [e^, e ik ] = 1, [e^, e k j] = 1, [e^, e k i] = 1, 

[eij, eji] = en for all distinct i,j,k,l. 
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The following variation, suggested by (2.6), allows us to use fewer generators: 

Proposition 2.8. SL(n,p) has the following presentation when n > 4: 
Generators: ey and eji for 2 < j < n. 

Relations: Whenever k are distinct and not 1, = e\ x = [eji,ey] p = 1, 
[ey.eife] = 1, [ea,e k i] = 1, [[ea, ey], e k \] = [[ea, ey], ei fe ] = 1, [[en, ey], ea] = ea 
and [eii, [ea, ey]] = ey . 

Proof. Let := [ea,ey] for i ^ j. Note that [eji,ey] commutes with [efei,eu] 
because it commutes with e k \ and en. It follows easily that (2.7) holds. □ 

This motivates presentations in Theorems 6.1 and [GKKL2]. 

2.3. Gluing. The following lemmas contain the basic idea behind our methodology 
We start with a subgroup T - with a suitable presentation - of our target group G, 
acting on some set with boundedly many orbits. We use this to prove that a small 
number of relations - each arising from one of these orbits - implies a very large 
number of relations. 

Lemma 2.9. Let T = (D \ R) be a presentation of a transitive permutation group 
acting on {1,2, ... ,n} such that D\ C D projects onto generators of the stabilizer 
T\ of the point 1. Assume that every ordered triple of distinct points can be sent to 
(1, 2, 3) or (1, 2, 4) by T. Let Vi be words in D mapping onto permutations sending 
ml for i = 2,3, 4. Then the following is a presentation for a semidirect product 
T K SVi+i {where S n +i acts on {0, 1, . . . , n}): 
Generators: D and z (viewed as the transposition (0, 1)). 
Relations: R, z 2 = 1, [z,D\] = 1, [z,w 2 ] 3 = 1, 
{[z,v 2 ][z,v 3 ]) 2 = ([z,v 2 }[z,v 4 }) 2 = 1. 

Proof. Let G be the group defined by the above presentation. In view of the natural 
map G — > T k 5„+i we can apply Lemma 2.3 to identify T with (D). That map 
sends z T onto the set of all transpositions (0,i), 1 < i < n. Since [z,Z?i] = 1 it 
follows that T acts on z T as it does on {1, 2, . . . , n}. 

The relations z 2 — [z,v 2 ] 3 = ([z, v 2 ] [z, v 3 ]) 2 = ([z , v 2 ][z , V4]) 2 — 1, together with 
the assumed transitivity of T, imply that x 2 = (xy) 3 = (xyxw) 2 = 1 for any distinct 
x,y,w € z T . By (2.6), N := (z T ) S S n+1 . 

Clearly G = TN. Since G maps onto T k 5 n+ i, we have G = T x S n+ i. □ 

Note that, in fact, G = C G (S n+ i) x S n+1 = T x S n+1 . 

Lemma 2.10. Let T = (D \ R), T\, D\, Vi, be as in Lemma 2.9. Then the following 
is a presentation for a semidirect product T tx SL(n + l,p) {where T permutes the 
last n rows and columns) : 

Generators: D, e and / (viewed as eoi and eio, cf. Proposition 2.8). 
Relations: R, [e,£>i] = [/, L>i] = 1, and both e, e" 2 , e V3 , /, /" 2 , f V3 and e,e V2 ,e Vi , 
f,f V2 ,f Vi satisfy the relations for SL(4,p) in Proposition 2.8 (where each of these 
ordered 6-tuples plays the role of ei 2 , ei3, e^, e 2 i, e 31 , e^i). 

Proof. Let G be the group defined by the above presentation. By Lemma 2.3, we 
can identify T with the subgroup (D) of G. 

The natural map G — > T K SL(n+ l,p) sends e T U f T onto the set of all matrices 
eu, en, 2 < i < n. Since [{e, /}, D{\ = 1, it follows that |e T U f T \ — 2n and that T 
acts on both e T and f T as it does on {1,2,..., n}. 
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The relations on e, e V2 , e V3 , /, f V2 , f V3 and e, e" 2 , e" 4 , /, f V2 , /" 4 , together with the 
assumed transitivity of T, imply that the hypotheses of Proposition 2.8 hold for 
N := (e G , f G ). Then N = SL(n + l,p). As above, G is a semidirect product of N 
andT. □ 

3. Symmetric and alternating groups 

In this section we will show that S n and A n have bounded presentations of length 
O(logra). By Lemma 2.2(i), it suffices only to consider the symmetric groups. 

We start by obtaining short bounded presentations for PSL(2,p) (in Sections 3.1 
and 3.2). We use these in order to deal with the case n = p + 2 with p > 3 prime, 
and then deduce the general S n from this case. 

3.1. SL(2,p) and the Congruence Subgroup Property. It is not at all ob- 
vious that even the groups PSL(2,p) have short bounded presentations. Our first 
such presentation is based on results concerning arithmetic groups. The idea of 
using the Congruence Subgroup Property to get bounded presentations of finite 
groups has already appeared in [BM, Lub3]. Here we show that the presentations 
can be also made to be short. In the next section we will give a more explicit and 
far more elementary presentation. 

Proposition 3.1. If p is an odd prime then PSL(2,p) has a short bounded presen- 
tation. 

Proof. We sketch a proof using the Congruence Subgroup Property. By [Serl, Ser2] 
and [Mar, Theorem 2.6], SL(2, Z[l/2]) is finitely presented, has the Congruence 
Subgroup Property and all its non-central normal subgroups have finite index. In 
fact, SL(2,Z[l/2]) has a finite presentation based on its generators 

M .-(; ;>-(-" S)-*-^ ")- 

Adding the additional relation u p — 1 and using the above properties of normal 
subgroups produces a bounded presentation for SL(2,p) for p > 2. While this 
presentation appears not to be short, it can be converted to a bounded presentation 
of length O(logp) by using a group-theoretic version of Horner's Rule [BKL, p. 512]: 

^ 3 ^ If 2 < m < p then u m = Ut( u " H ) h ' 2 = u m ° h^ 1 u m ^ h^ 1 u m2 ■ ■ ■ h^ x u mk h\ 

has length O(logp), where k = [log 4 m] and m = J2 i=0 mj4* in base 4. 
Namely, since u h2 — u A in (3.2), 

(3.4) u m = u m « ■ ■ ■ (u mk ) h2 = u mo h^u mi h 2 ■ h^ 2 u m2 h\ -h^--- h 2 - k u mk h k 2 , 

which collapses as in (3.3). 

This produces a short bounded presentation of SL(2,p). In order to obtain one 
for PSL(2,p), we write —1 as a short word as in [BKL] (cf. Lemma 3.8) and add 
one further relation to kill it. □ 

This method of converting to a short bounded presentation will be used often in 
the rest of this paper. 

We note that the argument used in the proposition can be greatly generalized: 
the Congruence Subgroup Property (combined with Margulis' normal subgroup 
theorem) can be used in a similar manner to provide short bounded presentations 
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for various other families of finite simple groups of fixed rank over suitable fields 
(cf. [Lub3]). 

Sunday [Sun] used the presentation in [BM], which was obtained using the 
Congruence Subgroup Property as in the above proposition, in order to produce 
the following presentation for PSL(2,p) of bounded plength and bit-length O(logp), 
though not of bounded length: 

(u,t \u p = l,t 2 = {utf,{uHu^ +1 ^Hf = 1). 

There is no presentation for this group with smaller \X\ + \R\; by [Sch], this follows 
from the fact that the Schur multiplier of PSL(2,p) has order 2. In view of (3.3), this 
presentation can be modified to a short bounded presentation by including h% as a 
new generator, and using the additional relations u h2 — u A and hit = u 2 (m 2 )'u 2 , 
where 2 = (p+ l)/2 is the integer between 1 and p such that 2-2=1 (mod p). 

3.2. PSL(2,p). We have separated these groups from the general case because 
we need them for symmetric groups and they are less involved than the general 
PSL(2,g). The latter groups require a lot more preparation (in Sections 4.1 and 
4.3.1), due to headaches caused by field extensions. The ideas used in this section 
reappear in Sections 4.3 and 4.4 with many more bells and whistles. 

Fix a prime p > 3 and a generator j of F*. Let 2 be as above, let j be the integer 
between 1 and p such that j ■ j = 1 (mod p), and define 

( 3,, 

Theorem 3.6. If p > 3 is prime then PSL(2,p) has the following presentation 
of length O(logp) with 4 generators and 8 relations, where integer exponents are 
handled using using (3.3): 

Generators: u, hi, h,t (which we think of as the matrices in (3.2) and (3.5)). 
Relations: 

(1) up = 1. 

(2) u h2 = u 4 . 

(3) u h = u j \ 

(4) t 2 = 1. 

(5) h* = h~ 1 . 

(6) t = uu*ti, h 2 t = u 2 (u 2 )'u 5 , ht = vP^fvP. 

Proof. Let G be the group defined by this presentation. Then PSL(2,p) is an 
epimorphic image of G. By (6), G = (u, u 1 }. Then G is perfect since u 3 = [u, /i 2 ] = 1 
(mod G') by (2), while vP = 1. 

We claim that Z := (z) < Z(G), where z := h^- 1 ^ 2 . For, if we write f- 1 = 
1 + mp for some integer m, then (3) implies that u z = ^ <P ]l — uu mp — u by 
(1). Thus [u,z] = 1. Similarly, [u*,z] = 1 since (u 1 )^ 1 = (u 1 )^ by (4) and (5). 

By (1) and (3), in G/Z the subgroup generated by u and h is isomorphic to a 
Borel subgroup of PSL(2,p), while the subgroup generated by t and h is dihedral of 
order p— 1 by (3), (4) and (5). Also, by (6), if k = 1 or j then there are integers 
k', k" and k" 1 such that 

(u k y = u k 'h k "tu k "' . 
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Conjugating these relations using (h) produces the same type of relation whenever 
1 < k < p — 1 (by (3), (h) acts on the nontrivial powers of u with only two orbits, 
with orbit representatives u and 

Now we have verified the standard Steinberg presentation for PSL(2,p) [St2, 
Sec. 4]. (That presentation, as well as more general ones that are also based on 
groups with a BN-pair of rank 1, will be very useful for us; see the beginning of 
Section 4.4 for more details.) Thus, G/Z = PSL(2,p), and G is a perfect central 
extension of PSL(2,p). The only perfect central extensions of PSL(2,p) are itself 
andSL(2,p),sothatG = PSL(2,p)by (4). □ 

Remark 3.7. Changing (4) to [t 2 ,u] = 1 produces a presentation for SL(2,p). 
The presentation for PSL(2,p) at the end of Section 3.1 uses fewer relations. 

For later use, the following observation is crucial [BKL, p. 512]: 

Lemma 3.8. If p > 3 then every element o/PSL(2,p) and SL(2,p) can be written 
as a word of length O(logp) in the generating set {u, h 2 , h, t} used above. 

Proof. Every element can be written as the product of a bounded number of powers 
of u and u* (cf. [BKL, p. 512]). Now use (3.3) and relation (2). □ 

This lemma also holds for the presentation in Proposition 3.1. 

3.3. S p +2- In this section we will obtain a short bounded presentation for S p+ 2 
when p > 3 is prime. 

Start with a short bounded presentation of PSL(2,p) = (X | R) such as the one 
in Theorem 3.6, so that X consists of elements u, h, h 2 , t corresponding to (3.2) and 
(3.5). 

In the action of PSL(2,p) on the projective line {0, 1, ... ,p — 1, oo}, we identify 
a € F p with the 1-space ((a, 1)) of F , so that 

u=(0,l,...,p-l) 
and t interchanges and oo. Then 

B := (u\h) 

is a Borel subgroup fixing while (u, h) fixes oo. There are exactly two orbits of 
PSL(2,p) on ordered triples of points, with representatives (0, — 1, oo) and (0, — s, oo) 
for any non-square s in F*. 

We view S p+ 2 as acting on the set {*, 0, 1, . . . , p— 1, oo}, with PSL(2,p) fixing *. 

Lemma 3.9.5 p+ 2 has the following presentation with 5 generators and 15 relations: 

Generators: X and z (viewed as the transposition (★, 0)). 

Relations: 

(1) R. 

(2) [z,u*] = [z,h] = l. 

(3) z 2 = 1. 

(4) Mf = l. 

(5) (M][z, u ]) 2 = (MM) 2 = i- 

(6) (zu)p +1 = 1. 

Proof. Let G be the group defined by the above presentation. It is easy to see that 
G maps onto S p +2- (The various relations reflect the facts that, when viewed as 
permutations, z commutes with elements having support disjoint from {*, 0}; the 



PRESENTATIONS OF FINITE SIMPLE GROUPS 



15 



supports of z and t have just one common point, as do the supports of z and it; 
and [(*,0)(*,^)(*,0)(*,-'l)] 2 = 1 = [(*,0)(*,^)(*,0)(*,- S )] 2 .) 

By Lemma 2.3, G has a subgroup we can identify with T = (X) = PSL(2,p). 
By (2), z commutes with the stabilizer B — (u*, h) in T of 0. 

By (2)— (5), we can use Lemma 2.9 (with t,u,u s playing the roles of 1)2,1)3,1)4: 
they send oo h 0,-1 h and — s 0, respectively). Thus N := (z T ) = S p+ 2- 
Also, N < (T, z) = G. 

Now u = u p+1 = (zu) p+1 = 1 (mod N) by (6), so that G/N is a homomorphic 
image of PSL(2,p) such that u is mapped to 1. Then G/N = 1 since PSL(2,p) is 
simple. □ 

Remark 3.10. This is a bounded presentation with bounded plcngth and bit- 
length 0(logp). This presentation can be modified to a presentation of length 
0(\ogp) with 0(\ogp) relations (cf. Section 1.2, Example 2). However, while it is 
bounded it is not short due to the long relation {zu) p+1 = 1 (note that u s can be 
made short using (3.3)). Nevertheless, we will use the above presentation to obtain 
a short bounded presentation in the next theorem: we will deduce this long relation 
from shorter ones by using a second copy of PSL(2,p). 

Remark 3.11. One of our main methods has been to use (nearly) multiply tran- 
sitive permutation groups in order to "move" some relations and hence to de- 
duce many others. For example, Lemma 2.9 is concerned with groups that are 
3-transitive or close to 3-transitive. We will use this idea again in Section 6.1 and 
[GKKL2]. Moreover, in [GKKL1] and [GKKL2] we even use sharply 2-transitive 
groups for this purpose rather than more highly transitive groups. 

In preliminary versions of the above lemma, when we sought bounded presenta- 
tions but had not yet approached ones that are both bounded and short, we pro- 
ceeded somewhat differently For example, we used a simple and standard bounded 
presentation for the 3-transitive group PGL(2,p) instead of using its subgroup 
PSL(2,p). 

An even earlier approach to symmetric groups started with a much more com- 
plicated special case, but was nevertheless interesting. First we obtained bounded 
presentations for SL(fc, 2) for infinitely many k, as follows. The Steinberg group 
St4(Z(xi, . . . ,Xd}) over the free associative (non-commutative) ring on d variables 
is finitely presented for every d > 4 [KM] . Consequently, if R is a boundedly pre- 
sented quotient ring of Z(xi, . . . , x<j), then St 4 (i?) is boundedly presented. Note 
that the matrix algebra M t (¥ q ) is boundedly presented (e.g., M t (F p ) = (A,B \ 
A 1 = B l = 0, BA + A t ~ 1 B t ~ 1 = 1, pi — 0); alternatively, a bounded presentation 
for the general M t (¥ q ) can be obtained using the fact that it is a cyclic algebra). 
Therefore, St4(M t (F 9 )) = St 4t (F 9 ) = SL(4t,q) also is boundedly presented. Since 
SL(4i, 2) has boundedly many orbits on 4-sets of nonzero vectors, we can proceed in 
a manner similar to the proof of Lemma 2.9 in order to obtain a short presentation 
for 5*24t_i by using the presentation (2.5) instead of (2.6). 

Remark 3.12. We need the following additional elements o/PSL(2,p) for future 
use: 

• k of order (p + l)/2, acting with two cycles on the projective line (this is a 
generator of a non-split torus), and 
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• I such that 1 and I are representatives of the two B, (k) double cosets in 
PSL(2,p). Thus, 

(3.13) (X) = B(k) UBl(k). 

In order to relate these to our previous generators, we use Lemma 3.8 in order to 
obtain two short relations expressing k and I in terms of X . 

We emphasize that the order of k follows from the relations in Theorem 3.6. We 
now use this observation. 

Remark 3.14. The element zu acts as a (p + l)-cycle (*, 0)(0, 1, 2, . . . ,p — 1) = 
(*,0, 1, 2, ... ,p — 1), and z t (zu)z t is the (p + l)-cycle (0, 1, 2, ... ,p — 1, oo). (Note 
that we are multiplying permutations from right to left.) Let a be a permutation 
of the points of this line such that 

k a = (z t (zu)z t ) 2 and a = (*)(0)(1, i(0), . . . ). 

Such a permutation exists because k fixes * and acts as the product of two disjoint 
(p + l)/2-cycles with representatives and 1(0), while (z t (zu)z t ) 2 fixes * and acts 
as the product of two disjoint (p+ l)/2-cycles with representatives and 1. (N.B. - 
It seems that a cannot be expressed as a short word in our generating set, but this 
will not cause any difficulties because a will not be used in any explicit manner.) 
Thus we have a relation 

(zu) 2 = z^z 1 . 

With all of this notation, we are ready to prove the following 

Theorem 3.15. S p +2 has a presentation of length 0(logp) with 9 generators and 
26 relations. 

Proof. We will use two copies (X \ R) and (X' \ R') of the presentation for PSL(2,p) 
obtained in Theorem 3.6 and Remark 3.12 (where X n X 1 = 0); the natural map 
x x', x e X, extends to an isomorphism "prime" :(X \ R) — > (X' \ R'). We will 
show that S p+ 2 has the following presentation: 

Generators: X U X' and z. (We view z as (*, 0) and X' as X a in the preceding 

remark.) 

Relations: 

(1) RUR'. 

(2) [z,u t ] = [z,h] = [z,u' t '} = [z,h'} = l. 

(3) z 2 = 1. 

(4) Mf^l. 

(5) (M][z, u ]) 2 = (MM) 2 = i- 

(6) [z^u- 1 ] = [zj'- 1 } and (zu) 2 = z l k' z l . 

Relations (6) are especially important since they link the two copies for PSL(2,p). 
Note that the first relation can be rewritten z u — z l 

Let G be the group defined by this short, bounded presentation. By the preceding 
remark, there is a surjection 7r: G — > S p +2- (Note that this uses the permutation a 
that is not actually in the presentation! The first relation in (6) is based on the fact 
that both u^ 1 and (l 17 )^ 1 conjugate (*,0) to (★, 1) since l"(0) = cr _1 (?(0)) = 1.) 

We claim that X and z satisfy the relations in Lemma 3.9. This is clear for 
Lemma 3.9(l)-(5). Moreover, the present relation (6) implies Lemma 3.9(6): 

(zuf +1 = (z*k' V)(p +1 >/ 2 = z'fc' (p+1)/ V - 1 
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since k ,lyP+1 ^ 2 = 1 follows from the relations R' . Thus, by Lemma 2.3 we can 
identify G* := (z, X) with S p +2 in such a way that z is a transposition and |z^ z,x ^ | = 
p+l. Then G* = 

We can also identify (X') with PSL(2,p). In place of (3.13) we will use (X') = 
B'(k') U B'l'- 1 {k'), where B' = (u n ',ti). Since z commutes with B' by (2), it 
follows that 

z (x') = z (k>) u (Z" 1 )^') = z <k>) u (z «- 1 ) <fe'> c 
by both parts of (6). Then 

p+l = \tt(z< z ' X ">)\ < \z^ z - x '^\ < \z< z ' x '>\=p+l, 

so that z^ z - x ') = z^-^ and hence G* = (z< z - x >) = { z {^x,x')) = ( Z G) ^ q 

Clearly, G/G* is a homomorphic image of (X') = PSL(2,p) in which k' is mapped 

to 1, since k' G (X,z) = G* by (6). Therefore G/G* = 1 since PSL(2,p) is simple. 
Replace k' and I' by the words representing them in order to obtain a presentation 

with 9 generators and 26 relations. □ 

We need further properties of the preceding generators in order to handle general 
symmetric groups: 

Lemma 3.16. The following elements of S p+ 2 can be written as words of length 
0(\ogp) in X U X' whenever 1 < i < p — 1: 

tt 4 := (*,0, 1,. \ := (i + 1,.. .,p- l,oo) 

Oi := («, i + 1) r := (*, oo) 

0:=(p-l,oo). 

Proof. In each case we will express the permutation in terms of elements that can 
be moved into one of our copies of PSL(2,p) and hence can be written as short 
words by Lemma 3.8. We start with the elements 

u= (0,l,...,p-l),A;' = 0*(zit)z*) 2 = (0, 1, . . . ,p-l,oo) 2 , z= (*,0), t= (oo, ())■■■ 

inlUl'. Then 

r = z t 9 = (0, oo) u = {z u ) T (*,!) = uzu- 1 

(0, 1) = {uzu^Y Oi = u i (uzu- 1 ) z u- i . 
If 1 < 2j - 1 < p- 1, then 

(oo, 0, 1, . . . ,2j - 1) = (0, 1, . . . ,p- l) 2j (0, 1, . . . ,p- l,co)- 2 ^ = u 2j k'~ j , 
so that 

7T2j— i = z*(oo, 0, 1, . . . , 2j - l)z* = z t u li k'- i z t 

1"2j = 7I"2j-1^2j-l 

Vi-« = (0,l,...,p-l,oo)-«(0,l,...,p-l) 2 J=fc'-Ju« 

A (p-l-2j)-l = ^p-l-2jA p _i_2j- 

By Lemma 3.8, the elements u l , u 2j G (X) and fc'- 7 G (X') can be expressed as short 
words in our generators. □ 

Note that the same argument shows that all cycles + 1, . . . , j), and all per- 
mutations with bounded support, can be written as short words. 
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3.4. S n . Finally, we consider all symmetric groups: 

Theorem 3.17. S n has a presentation of length O(logn) with 18 generators and 
58 relations. 

Proof. By Bertrand's Postulate, if n > 6 then there is a prime p such that 
(n - 2)/2 < p < n - 2. Then i := 2(p + 2) - n - 2 > 1. We will use two 
copies (X | R) and (X \ R) of the presentation of S p+ 2 in Theorem 3.15 (where 
X C\ X = $), and identify them along a subgroup Si+2- 

The map x ^ x, x E X , extends to an isomorphism "tilde" :(X | R) — > (X \ R). 
Then the elements Aj,0,,Aj,f are as in the preceding lemma. We will show that 
S n has the following presentation: 
Generators: X, X. 
Relations: 

(1) RuR. 

(2) z — z, Hi — 7fj. 

(3) [\ i ,\ i ] = [h,f] = [e i ,\ i } = [6i,r] = l. 

Let G be the group defined by this bounded presentation of length O(logn). By 
Lemma 2.3, G is generated by two copies (X) and (X) of S p+ 2- These are linked 
as follows: 

i + 1, z + 2, ... _p — 1, oo 

(3.18) *,0,l,2,...,i-l,i, 

2+1, 2 + 2, ... p — 1, OO 

Namely, the first copy of S p +2 acts on the first and second rows, and the second 
copy on the second and third rows; the relations (2) identify the intersection, which 
is Si+2 acting on the second row. Thus, we will abuse notation and identify s = s 
for s € {*, 0, . . . Note that Sym{i, i + 1, i + 2, . . . ,p — 1, oo} = (Aj, 0j) = S^-j+i 
commutes with Sym {i + 1, . . . ,p — 1, oo,*} = (A»,f) = 5 p _,+i, by (3), since i > 1. 

In order to prove that this defines S n we will use the Coxeter presentation (2.5) 
with the following ordering of our n points: 

(3.19) oo,p — 1, . . . , i + 1, i = i, i — 1, . . . , 2, 1, 0,*, oo,p — 1, . . . , i + 1. 

The first copy of 5^+2 acts from oo to and the second copy acts from i = i to 

r+i. 

For any adjacent points a,b in (3.19), there is a well-defined element <?( a ,&) of G 
that acts as a transposition in at least one of our two symmetric groups S p +2 ■ The 
elements g( a ,b) generate G since they generate both symmetric groups. 

The natural surjection G — > S n maps gt a ,b) to the transposition (a, b). In order 
to prove that G = S n ,it suffices to prove that the elements g( a ,b) satisfy the Coxeter 
relations (2.5). 

Table 2. Pairs of pairs of points 



a, b 


c, d 


reason 


oo, . . . , i 
oo, . . . , i 


oo, . . . ,* 
*, . . . ,i + 1 
i,i - 1, . . . ,i + 1 


[(A i ,^),(A i ,f)] = lby(3) 

Sp+2 
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Consider four points a, b, c, d occurring in this order in (3.19), with a, b and c, d 
two distinct pairs of adjacent points. (We allow the possibility b — c.) We tabulate 
the various cases in Table 2, including the reason that the required relation in (2.5) 
holds. 

Thus, we obtain a short presentation for S n with 2*9 generators and 2*26 + 2 + 4 
relations. □ 

Corollary 3.20. S n has a bounded presentation with bounded plength. 

Proof. Use the preceding presentation without converting the various powers of u 
and k' into short words in the proof of Lemma 3.16. (Of course, we should also use 
Lemma 3.9 in place of Theorem 3.15, since the latter is not relevant here.) □ 

Remark 3.21. Neither n nor log n appears explicitly in this presentation. However, 
n is encoded in the presentation (in binary): p certainly is, and i is encoded through 
the word for A, in Lemma 3.16. In other words, n can be reconstructed from p and 
i, which are visible in the presentation. 

Remark 3.22. For use in Section 6.1, for n > 8 we calculate a number of additional 
elements of S n that can be written as short words in X U X. 

(i) In Section 6.1 we will use the following renumbering of our points: 

*, 0, 1, . . . , i - l,i + 1, . . . ,p - 1, oo, i, . . . ,p - 1, oo = 2, 3, 4, 5, . . . , n, 

where n := n + 1: we will be considering the stabilizer of 1 in the symmetric group 
= Sym{l, . . . , n}. Then the permutations 

z = (2, 3) a := tx^-y^-i = (2, 3, 4, 5, . . . , n) 

z CT ~ 4 = (6, 7) zz CT ~V~ 2 = (2, 3, 4, 5) 

{zz^ 1 z°~ 2 ' z a ^Y x a = (6, 7, . . . , n) [zz a ^ z a ^ z a ^ z^ 1 )- 1 a = (7, . . . , n) 

can be written as words of length O(logn) in our generators. 

(ii) We will also need information concerning parities of some of our generators: 
the stabilizer of the point * = 2 is generated by a set X U X of even permutations 
(the generators of our two copies of PSL(2,p)) together with the odd permutation 
y := z a = (3, 4) that we will now include among our 18 + 1 generators. 

(hi) Since yy a = (3, 4, 5) we see that, in the alternating group ^4 n -i on 
{2,3,4,5, ...,n}, (X,X,?/?/' T ~ 1 ) is the stabilizer of 2, and (u t ,h,X,(yy a ~ 1 ) c7 ~ 1 } 
= (B, X, {yy^ 1 )^ 1 ) is the stabilizer of both 2 and 3. 

Corollary 3.23. Every perfect central extension of A n has a presentation with a 
bounded number of relations and length O(logn). 

Proof. For A n this follows from Theorem 3.17 and Lemma 2.2(i). When n > 6 the 
center of the universal perfect central extension of A n has order 2 [GLS, Theorem 
5.2.3], so that Proposition 2.4 can be applied. □ 

4. Rank 1 groups 

Simple groups of Lie type are built out of two pieces: rank 1 subgroups such as 
SL(2, g), and Weyl groups such as S n - Therefore, as stated earlier, we have placed 
special emphasis on these types of groups. 
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4.1. The BCRW-trick. The following lemma is based on an idea in [CRW1], 
which [KoLu] called the CRW-trick. A year after first using this we learned that 
a similar idea had been used earlier (for a different goal) in [Bau]. We therefore 
rename this the BCRW-trick. This makes remarkably effective use of a simple 
observation: if u,v,w are group elements such that w = uv, then any element 
commuting with two of them also commutes with the third. This observation is 
used to show that, under suitable conditions, a subgroup can be proven to be abelian 
using a very small number of relations (see Section 4.2 for a different viewpoint and 
a generalization): 

Lemma 4.1. Let G be a group generated by elements u, a, b satisfying the following 
relations: 

(1) [a, b] = 1; 

(2) [u,u a ] — 1 and [u,u b ] = 1; and 

(3) each of u, u a , u b can be expressed as a word in the other two. 

Then (u^^) is abelian. 

It is clear that (3) and the first part of (2) imply the second part. We have stated 
the result in the present manner in order to emphasize the symmetry between a 
and b. 

Proof. Define u s ^ := u aSl>t for all integers s,t. It suffices to prove that 

(4.2) u — u ,o and u Stt commute for all integers s,t. 

First note that, by (1) and (3), 

, . If u commutes with two members of a triple wj, Ui+ij, Uij+i 
then it commutes with the third. 

Also, if u commutes with Uij then u b 3a ' — U-i-j commutes with u (using 
(1)). It follows that 

, . If u commutes with two members of a triple u i j 1 
then it commutes with the third. 

For, u commutes with two members of the triple U-i t -j, U-i+i t -j, U-i t -j+i, so by 

(4.3) it commutes with the third and hence with the third member of the triple in 

(4.4) . 

We will prove (4.2) by induction on n := \s\ + \t\. 

When n = 1 our hypothesis states that u = m ,o commutes with ui t o and uo,i; 
these and W-^o and u ^i are the only possible elements having n — 1, and we have 
already seen that u commutes with the preceding two elements. 

Assume that (4.2) holds for some n > 1 and consider nonnegative integers s, t 
such that s + t = n + 1. 

Case: = s<t = n + l. By induction, u commutes with the last two members 
of the triple U\ in , u , n , ui,„-i, hence by (4.4) with the first. Then, by induction 
and (4.3), u commutes with the first two members of the triple u ^, Ui, n , u o,n+i, 
hence with the third. 

Case: 1 < s < t. By induction and (4.4), u commutes with the last two members 
of the triple u s ,t, u s -i,t, u s ,t-i, hence with the first. 
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By the symmetry of our hypotheses, this completes the proof when s, t > 0, and 
hence also when s, t < 0. By symmetry it remains to consider the case s < < t, 
where we use (4.3) and the triple u Sj t_i, Ug+i^-i, u s ,t- d 

We will need various nonabelian versions of the preceding lemma. The most 
straightforward one is as follows: 

Lemma 4.5. Let Uo and Wq be subgroups of a group G, and let u, w, a, b be elements 
of G satisfying the following conditions: 

(1) [a,b] = l; 

(2) each ofu,u a ,u b can be expressed as a word in the other two together with 
elements of Uo; 

(3) each of w 1 w a ,w b can be expressed as a word in the other two together with 
elements of Wo; 

(4) [u,w] = [u a ,w] = 1; 

(5) US = U§ = U and Wg = W 6 = W ; and 

(6) [U ,w] = [u,W ] = 1. 
Then [(u^), (w^)} = 1. 

This time we have not preserved the symmetry between a and b; but note that 
[u b ,w] = 1 by (2), (4) and (6). 

Proof. Define u St t ■= u ^* and w s j := w a3bt for all integers s, t. It suffices to prove 
that 

(4.6) u Si t and w commute for all integers s, t. 

The proof parallels that of Lemma 4.1. This time, 

If u commutes with two members of a triple Wij, Wi+ij, Wjj+i 
„s then it commutes with the third. 

If w commutes with two members of a triple Uij,Ui + ij,Uij + i 
then it commutes with the third. 

For the first of these, by (1), (3) and (5) each of the three indicated elements 
is a word in the others together with elements of Wq — W£ — Wo; now use (6). 
Similarly the second assertion follows from (1), (2), (5) and (6). 

Even more trivially in view of (1), ifu commutes with Wij then 
commutes with w; and if w commutes with Uij then w b 1 a " = W-i,-^ commutes 
with u. It follows that 

, , If w commutes with two members of a triple mj, Ui-\j, Uij-i 
then it commutes with the third. 

Namely, u commutes with two members of the triple W-i t -j, W-i+i^-j, W—^—j+i, 
hence, by (4.7), u commutes with the third, and hence w commutes with the third 
member of the triple in (4.8). 

As before we prove (4.6) by induction on n := \s\ + \t\. 

When n = 0, u = uo,o and w = wo,o commute by (4). 

Consider the case n = 1. By (4), w commutes with the first two members of 
the triple uo.o, «i.o, Wo,i, and hence by (4.7) with Mo,i- Then w commutes with the 
first and last members of the triple ito,i, tt— 1.1, uo.o, an d hence by (4.8) with W-1,1. 
Then w also commutes with the last two members of the triple u-i,o, «o,o, 
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and hence by (4.7) with u_i,o- Finally, w commutes with the first two members of 
the triple ito,o, U-i t o, uo,-i, and hence by (4.7) with uo,-i- 

Assume that (4.6) holds for some n > 1, and consider nonnegative integers s,t 
such that s + t = n + 1. 

Case: = s<t = n+l. By induction, w commutes with the last two members 
of the triple ui ni u rll and hence by (4.8) with the first. Then, by induction 

and (4.7), w commutes with the first two members of the triple u n , ui n , u 0n+ i, 
hence with the third. 

Case: 1 < s < t. By induction and (4.8), w commutes with the last two members 
of the triple u St t, u s —i.t, u St t-i, hence with the first. 

By the symmetry of our hypotheses, this completes the proof when s,t > 0. 
When s, t < 0, proceed as in the above two cases but negate all subscripts. By 
symmetry, it remains to consider the case s < < t, where we use (4.7) and the 
triple u s ,i-i,tia+i,i-i,«a,t to prove that w commutes with u Sjt . □ 

In the proof of Proposition 4.25 we will use this lemma for more than one choice 
of Uo and Wo- The simplest use has Wo = 1 and w in the abelian group Uq. 

4.2. Some combinatorics behind the BCRW-trick. The proof of Lemma 4.1 
was concerned primarily with triples of subscripts. In order to handle Suzuki and 
Ree groups, we will need to generalize this subscriptology. For these purposes, and 
in order both to see the connection with the original BCRW Trick and to use the 
original one, we state the desired situation in some generality. 

Definition 4.9. Let M. be a finite collection of finite sets of vectors in Z fe . 

(1) A set S C Z fe is called closed provided that, for any v E Z k and any M E M, 
if S contains all but perhaps one of the vectors in the translate v + M = {v+m\m E 
M} then S contains the entire translate. 

(2) Fix a length function \v\ > for vectors v E"L k such that each ball about 
contains only finitely many vectors. 

A vector v E Z fe is called reducible if there exist u E Z fc and M E M such that 
v is the unique longest vector in u + M. In this case we will say that v is reducible 
via u + M. 

A vector is called irreducible if it is not reducible. 

(3) The collection M is called of finite type (with respect to the length function 
| • |) if there are only finitely many irreducible vectors in Z fe . 

(4) If a collection is denoted Mx, then M-x denotes {— M \ M E Mx}- 

Example 4.10. In order to relate this to Lemmas 4.1 and 4.5, we use A^bcrw = 
{{(0,0), (1,0), (0, 1)}}, consisting of a single triple in Z 2 , together with 

M ±bcrw — A4bcrwUA4_bcrw — {{(0, 0), (1, 0), (0, 1)}, {(0, 0), (—1, 0), (0, — 1)}}. 
The two collections A'fiBCRW arise naturally in Lemma 4.1, as follows. 

Let u = v and the group H := Z 2 acts on U = (u< a ^) via = u^^ . The set 
-Mbcrw = {M} corresponds to the relation u b — uu a , and hence to the fact that 
each element in {u, u a , u b } — u M — {u m \ m E M} lies in the subgroup generated by 
the remaining elements (as in Lemma 4.1(3)). If v € H then (u M ) v — u v+M . Hence, 
the translates of A^bcrw are exactly the triples of subscripts appearing in (4.3), 
while the translates of A1±bcrw are exactly the triples of subscripts appearing in 
(4.3) and (4.4). 
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Let S := {h £ H \ [u h ,u] = 1}. Since u v+ (°^ = u v u v+(1 ^ for any v e H, if u 
commutes with two of the elements u v+ ^°' 1 \ u v , u"+( 1 ' ) then it commutes with the 
third, so that S is closed under A^bcrw- Moreover, if [u , u] = 1 then [u , u] = 1, 
so that S = —S, and hence S is also closed under -M_bcrw- 

We used the length function |(x,y)| = |x| + \y\ in the proof of Lemma 4.1. 
Moreover, the way we used (4.3) and (4.4) emphasized the fact that Z 2 is the 
smallest 7W±BCRW-closed subset of Z 2 containing the vectors (0, 0), (±1, 0), (0, ±1) 
(a notion generalized below in Lemma 4.13). 

The notion of "reduced" was involved in our induction: commuting with two 
elements of a triple of group elements implies commuting with the third; in the 
induction, the third vector was "longer" than the other two. Our hypotheses gave 
us the above initial set of 5 irreducible elements of Z 2 , from which we "generated" 
all of Z 2 using the notion of closed. 

The situation in Lemma 4.5 was very similar. There we implicitly used S := 
{h € H | [u h ,w] = 1}, and our hypotheses were designed to imply that S is closed 
under -M±bcrw- We will formalize this somewhat more in Lemma 4.12. 

Example 4.11. We now reconsider the set .Mbcrw in the preceding example, 
but this time using the Euclidean length |u| 2 = x 2 + y 2 , v = (x,y). We wish to 
determine when v is irreducible. Note that 

if y < < x then (x, y) reduces via (x — 1, y), (x, y), (x — 1, y + 1), 
if x < < y then (x, y) reduces via (x, y — 1), (x + 1, y — 1), (x, y), 
if x, y < then (x, y) reduces via (x, y), (x + 1, y), (x, y + 1), 
if0<y<x — 1 then (x, y) reduces via (x — 1, y), (x, y), (x — 1, y + 1), and 
if0<x<y— 1 then (x, y) reduces via (x, y — 1), (x + 1, y — 1), (x, y). 
(Compare the proof of Lemma 4.1.) The only vectors (x, y) not covered by these 
observations are (0, y) for y < 0, (x,0) for x < 0, and (x,y) for x,y > and 
|x — y\ < 1, all of which are irreducible. 

For Lemma 4.1 we needed to have a vector (x, y) such that both (x,y) and 
(—x, —y) have the above form. It is clear that for this to happen one of x, y has to 
be and the other one 0, 1 or — 1. This proves that the only irreducible vectors for 
-Mbcrw are the 5 points inside a ball of radius 1 around (0,0). 

The next results deal with the ways these notions will be used later. First we 
formalize the use in a group-theoretic setting already hinted at in Example 4.10: 

Lemma 4.12. Let M. u and A4w be finite collections of finite sets of elements of a 
subgroup H = Z fe of a group G. Let Uq, Wq < G and u,w € G satisfy the following 
conditions: 

(1) If M G M.u and h G M then u h can be expressed as a word in Uq and 
{u l \leM\{h}}; 

(2) If M e Mw an d h € M then w h can be expressed as a word in Wo and 
{w l | I eM\{h}}; and 

(3) [U H ,w] = [u,W H ] = l. 

Then S := {h G H \ [u h ,w] = l} is closed under Mu and A4-w- 

Condition (1) essentially says that, for each M e Mu, there is a relation among 
the elements {u h | h £ M} that can be solved for each of these elements in terms of 
the others. Of course, we will want S to be nonempty, which will always be forced 
by having [u,w] = 1. 
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Proof. We will write H additively. First note that S is closed under M.u- For, 
if h e M e M.u and v e H, by (2) we can express u v+h as a word in Uq and 
I I eM\ {h}}. Therefore, by (3), if w commutes with u v+l for each I E 
M \ {h} then it commutes with u v+h . 

The preceding argument shows that S := {h £ H | [u, w h ] = l} is closed under 
Mw, so that S = —S is also closed under M-w (see Definition 4.9(4)). □ 

Next we observe how the notions "closed" and "finite type" (Definition 4. 9(1), (3)) 
will be used (compare the proofs of Lemmas 4.1 and 4.5): 

Lemma 4.13. Fix a length function as in Definition 4.9(2). If S is a closed subset 
of Z fc containing the set So of all irreducible vectors for M , then S = Z fe . 

Proof. List all vectors in Z fe by non-decreasing length: v\, v 2 , ■ ■ ■ (recall the finite- 
ness assumption in Definition 4.2(2)). Using induction on k we will show that 
Vk € 5\ If f/j, is irreducible it is in So and therefore in 5\ If is reducible then 
there is a translate v + M, v e Z k , M e M, in which Vk is the longest vector. By 
induction the other vectors in v + M arc in 5\ Since S is closed, also Vk € S. □ 

Corollary 4.14. // the set M. is of finite type then there is a finite set So C Z k 
such that, if a subset SCZ 1 is M-closed and contains So, then S = Z, k . 

Although the length function does not appear in the statement of the corollary 
it was needed in the proof. Moreover: 

Remark. If we only consider lengths arising from positive definite bilinear forms 
on Z , then the property that M. is of finite type does not depend on the specific 
choice of length. Of course, the precise number of irreducible vectors does depend 
on the choice of length function. 

4.3. Central extensions of Borel subgroups. In this section we apply the 
results in the preceding sections to go one step closer to simple groups of rank 1: 
in Propositions 4.19, 4.25 and 4.30 we will provide short bounded presentations for 
infinite central extensions of their Borel subgroups. (Recall from Remark 1.2 that 
Borel subgroups do not have such presentations.) 

4.3.0. Polynomial notation. All of our presentations for groups of Lie type involve 
dealing with polynomials over finite fields. 

1. If 7 lies in an extension field of F p , then m 1 (x) will denote its minimal 
polynomial over F p . If 5 e F p py], let fs-,-y{x) £ ¥ p [x] with fs--y(j) — S and deg/,5; 7 < 
deg m 7 . 

2. For any polynomial g(x) = YTo9i xl e an( ^ an y * w0 elements w, ft in a 
group G, define "powers" as follows, multiplying in the stated order: 

(4.15) [[u 9{x % = {u 9 °)(u gi ) hl ■ ■ ■ {u 9 °) h \ 
so that 

(4.16) [[u g{x) ]]h - u^h^u^h^u 92 ■■■h- 1 u 9 'h e . 

If < gi < p then this is a word of length 0(pe) in u, h; (3.4) explains the collapse 
of the exponents here (cf. [BKL, p. 512]). 

Moreover, if < gi < p and some feeG satisfies u h2 — u 4 or u h2 — u 2 , then 
we can apply (a version of) (3.3) for each u 9i , and hence we can write as 
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a word of length O(elogp) in u, hi and h. Note that then depends only 

on the image of f(x) in ¥ p [x}. 

Care is needed with (4.15): we cannot use it to calculate 

i n the obvious manner unless we know that all of the conjugates u h ' 
commute with one another. If this occurs and u p = 1, then [[M s ( x ) +fe ( x )]]/j = 
[[u»(*>]] h [[u*(*)]] h) and [[ufWsW]] h = 1 if [[«**>]]/, = 1. 

We will usually use these formulas when h is such that the conjugates u h ' gener- 
ate an isomorphic image of the additive group of ¥ q and the action of h corresponds 
to multiplication by some 7 on ¥ q = ¥ p [j]. In this case, we have [[u m -< { ^]} h = 1, 
and an isomorphism is given by 

[{u fs -yW]] h for 5e¥ q . 

(Relations used in Propositions 4.25, 4.30 and 4.33 contain variations on this situ- 
ation in nonabelian settings.) 

4.3.1. Special linear groups. Let ( be a generator of F*, where q = p e for a prime 
p. Our first result (Proposition 4.19) requires additional data that exists only when 
q 7^ 2, 3, 5, 9: we need a, b € F* such that 

(4.17) b 2 = a 2 + l and ¥ q =¥ p [a 2 }. 

Such elements exist for the stated q: if £ 2 + 1 and £ 2 — 1 are nonsquares then ( 4 — 1 
is a nonzero square a 2 ; and this generates ¥ q for the stated q since ( 4 is in no proper 
subfield. We emphasize that we will be working with a 2 rather than £; the latter 
will be needed only to obtain an element of the desired order q— 1. See Lemma 4.20 
for a variation on the present theme that does not require the equation b 2 = a 2 + 1 . 
We base our presentation on the following matrices: 

(4.18) u= (o \ )> h *=(*() 1 I) for*e{2,a,6,C}. 

(If p = 2 discard h 2 .) Then B := (u, h^) is a Borel subgroup of SL(2, q), of order 
<?(<?- 1). 

With this preparation, using the polynomial notation in Section 4.3.0 we can 
consider the following presentation. 

Generators: u, I12, h a , hb, h^. (If p = 2 discard h^-) 
Relations: 

(1) [KM = 1 for*,, e {2,a,6,C}. 

(2) u hb = uu ha = u ha u. 

(3) u h2 = u 4 . 

(4) up = 1. 

(5) = 1. 

(6) = [[uA'i. 2 ^)]]^. 

Here and in the remainder of this section we view each of our relations as having 
length 0(\ogq) by using (3.3) and (4.16). 

Proposition 4.19. If q ^ 2,3,5,9, then this produces a bounded presentation of 
length 0(logq) for an infinite central extension of a Borel subgroup B of SL(2,q). 

Proof. Let B be the group defined by this presentation. Using the matrices written 
above it is easy to check that the natural map n: B — > B sending u, h{, h a , hb, hi 
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to "themselves" is a homomorphism. In particular, (2) arises from the fact that hb 
acts on ( J i ) as multiplication of * by b 2 = a 2 + 1 . 

By (1), (2) and Lemma 4.1 (with h a and hb in place of a and b), U := (u^ ha ' > ) = 
( U KK | i^ j g Z) is abelian, and hence is an elementary abelian p-group by (4). In 
particular, the definition of [[u 9 ^]]/i a is independent of the order of the terms for 
g(x) € ¥ p [x]. 

Clearly h a acts as a linear transformation on U. By (5), its minimal polynomial 
divides the irreducible polynomial m a i (x) . Since the images of u under powers of 
this linear transformation span U, we have \U\ < q, and then we obtain equality 
using 7r. 

Now 8 i ► [[u^ a;a2< ' a: ' l ]]/i a is an isomorphism — » U. Since fs a 2 ;a 2 { x ) = xfs ;a 2 ( x ) 
(mod m a 2(x)) by the definition of fs- a 2 ( x ) m Section 4.3.0, this map extends to an 
F p [x]-module isomorphism with x acting on ¥ q as multiplication by a 2 and on U 
as conjugation by h a . (Note that there are choices for this isomorphism: in place 
of a 2 we could use any conjugate of it by a field automorphism.) 

By (6), the linear transformation induced by on U has order (q— l)/(2,q — 1). 
Thus, /i£ acts on U exactly as occurs in B. All h± do as well. It follows from (1) 
that B is a central extension of B, as required. 

In view of (4.15), this presentation involves integer exponents that can be much 
larger than logp. However, using (3) these can be shortened as in (3.3). Similarly, 
exponents that are polynomials are dealt with in (4.16) using short words. Thus, 
this presentation produces one that is short and bounded. 

Finally, there is a semidirect product C/xZ that satisfies the above relations: we 
are dealing with an infinite group. □ 

Note that 

Z(B) = (h d ( ,h 2 h- n{2 \h a h- n(a \h b h- n{h) ), 

where d = |C 2 | = (q - l)/{2,q- 1) and C" W = * for * e {2, a, b}. Almost all of 
Z{B) will disappear when we deal with simple groups. 

As we saw above, we can choose a = ( or b = ( or b = ( 2 ; correspondingly 
h a = or hb — h^ or hb = h 2 . This allows us to obtain a shorter presentation by 
removing one of the generators h a or hb, and hence also three relations in (1). 

The following is a simple variation on this proposition. 

Lemma 4.20. If q ^ 2, then the following produces a bounded presentation of 
length 0(\ogq) for an infinite central extension of the 1- dimensional affine group 
AGL(1, q) = {x ^ 1X + 6 | 7 e F*, S e F,}: 

Generators: u, h' 2 , h' ( -,h' ( - +1 . (If p = 2 discard h' 2 -) 

Relations: 

(1) [^,ft'.] = lfor*,.e{2,C,C + l}- 

(2) u^c+i = uu h 'i = u h '(u. 

(3) u h 2 = u 2 . 

(4) up = 1. 

(5) [[u m ^)]] h , = 1 ( C f. (4.15)). 

Proof. This is similar to the preceding proof, using the same u but based on the 
new diagonal matrices = (J 2) f° r * € {2, C, C + !}■ □ 



PRESENTATIONS OF FINITE SIMPLE GROUPS 



27 



4.3.2. Unitary groups. We now turn to nonabclian versions of the same theme: the 
unitary groups SU(3, q). This time let F = ¥ q 2, where q = p e and F* = {() as 
usual. We will use the following matrices that preserve the form X1X3 + x 2 x 2 +x$xi: 

/l a [3\ (1/2 0\ /5- 1 0\ 

(4.21) u(a,/3)= 1 -a , h 2 = 1 , h 5 = 5/8 1, 

\0 I J \ 2/ \ <5/ 

for any field elements such that (3 + (3 — —aa and 5^0. Let U be the set of all 
matrices u{a,(3); its center has a = 0. Moreover, 

(4.22) u(a, 0) h < = u(C 29 ~V C 9+1 /?), and u(a, (3) h2 = u(2a, 4/3) if p ^ 2. 

Then B = U xi (h^) is a Borel subgroup of SU(3,g). Its structure is as follows: 
|f/| = g 3 , Z(J7) = U' = $(U) is elementary abelian of order g, and U/Z(U) is 
elementary abelian of order g 2 ; the action of on {/ is given above, and it has 
order (q 2 — l)/d in that action, where d = (2g — 1, g 2 — 1) = (3, (7 + 1). 
We start with a property of F somewhat analogous to (4.17): 

Lemma 4.23. If q ^ 3,5 and g is odd, then there are elements a,b e F satisfying 

(4.24) a 29 " 1 + fc 2 ^ 1 = 1 and a q+1 + b q+1 = 1, 

suc/i tftaf F = Wpia 2 ^ 1 } and ¥ q = F p [a« +1 ]. 

If q = 2 e with e > 2, then there are elements a, b G F satisfying (4.24) such that 
F, =F p [a« +1 ] =F p [a 2 «- 1 ]. 

We give a proof in Appendix A. 

Proposition 4.25. There is an infinite central extension of the above Borel sub- 
group B of SU(3, q) having a bounded presentation of length O(logg). 

Proof. Until the end of this proof we assume that q 7^ 2, 3, 5. We use the abbre- 
viations 7' = 7 9+1 , 7" = 7 2<J ~ 1 for 7 e F. Section 4.3.0 contains the polynomial 
notation used below. 

Fix elements a, b e F as in the preceding lemma. We base our presentation on 
arbitrary elements u = u(a ,[3 n ) ^ Z(U) and l^me Z(U). 

We will use the following presentation with (at most) 6 generators and 20 rela- 
tions. 

Generators: u, w, h 2 , h a , hb, h^. (If p = 2 discard h 2 .) 



Relations: 




(1) 


[h*, h,] = 1 for *, • e {2, a, b, (}. 


(2) 


w = w a w b = w hb w ha . 


(3) 


ho 4 
w 2 = w . 


(4) 


w p = 1. 


(5) 


[[w m -' (x) ]]ha = 1. 


(6) 


[[w f <'»'W]]h a =W k <. 


(7) 


U = U ha U hb W\ — u hb u ha w 2 . 


(8) 


[u, w] — [u ha , w] = 1. 


(9) 


u h 2 _ U 2 W3 ^ 


(10) 


M P = W4. 


(11) 




(12') 


[\u f <"'"" (x) \]h a = u h <w 6 if q is odd. 
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(12") {[[u f «-.°"l x ^] h J h <[[u f 0-.*''W]] ha = u h lw 6 if q is even and C" satisfies 
C" 2 = a(" +/3for a,/3e¥ q . 
(13) [u,u h <] — wj and [u ha , u h z] — ws if q is even. 

Here wi, . . . , w$ are elements of W :— (w^ ha ' ) ); they are determined by £ 
and our chosen matrices u, w and h±. 

There is a surjection 7r : B — > £> from the group _B defined by this presentation 
onto B; this is the significance of the specific elements Wi. Note that (12") uses the 
fact that (" is not in ¥ q , and hence has a quadratic minimal polynomial over ¥ q . 

By (1), (2) and Lemma 4.1, W is abelian (using /i a , hj, in place of the elements 
a, b in that lemma) and hence elementary abelian by (4) . 

Using 7r we see that \W\ > q. Now (2), (5) and (6) let us identify W with 
the additive group of ¥ q in such a way that each h+ acts as multiplication by a 
conjugate of *' = * 9+1 . 

By (1), (7) and (8), we can apply Lemma 4.5 to deduce that [u, W] = 1 (using 
Uct = W, Wq = 1 and h a , in place of the elements a, b in that lemma). 

If U := (u< h '>,W), then [U,W] = 1 since W< h " h "> = W by (2). 

In view of (7), another application of Lemma 4.1 shows that U/W is abelian, 
and by (10) it is elementary abelian. 

Using 7T we see that \U/W\ > ^[a 2 "" 1 ]!. Now (11) lets us identify U/W with 
F p [a 2,J_1 ] in such a way that each h+ acts as multiplication by = ■k 2g ~ 1 . 

Case q odd. Since F = W q [C], (12') implies that acts irreducibly on U/W, and 
hence \U/W\ < q 2 . Thus, \U\ = q 3 . Since W = ®(U), the action of h c on U/W 
forces its action on W. Since U < B, by (1) it follows that B is a central extension 
of B, as required. 

Case q even. We use Lemma 4.5 in order to bound \{U, U h< ) /W\: in that lemma 
we use G — (h a , hb, U, U h< )/W, also uW and u hc -W in place of of u and w, and 
finally Uo = Wo = 1. The only condition that needs to be checked is Lemma 4.5(4), 
and this is just (13). Thus, [u^' h ^ , uM^.M] < w. It follows from (1) that 
[U,u h <] < W since U = (u< ha >,W). 

Then \(U, U h <)\ < q 3 by (1), and hence equality holds using tt. By (12"), 
(U, U h <) h < = (U, U h z). Now we can proceed as in the preceding case, using (U, U h <) 
in place of U. 

Once again B is infinite. There are at most 20 relations: 20 if q is odd, and only 
17 if q is even since we delete 3 from (1) together with (3) and (9). 

We still need to show that the length is O(logg). By (3), (3.3) and (4.16), 
each element of W has length O(logq) in our generators. Similarly, by (9), (10), 
(3.3) and (4.16), every element in (U,U hc -) /W has length 0(log<7), and hence the 
relations (11), (12') and (12") can be viewed as short. 

This completes the proof of the proposition, but for future reference we include 
presentations for the omitted cases q = 2, 3, 5. When q = 2, B = Q 8 x C3. For the 
remaining q we use the irreducibility of on Q/Z(Q). 
Generators: u,w,h^. 
Relations: 

(1) w p = 1. 

(2) [u,u h <] = w. 

(3) up = 1. 

(4) [u,w h l] = 1, i = 0,1. 
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(5) [[w m <'W]] h( = 1. 

(6) [[u m <"W]] h( = Wl . 
Here, wi G W := (w^). 

In the group with this presentation, W is cyclic of order q — p, and by (5) it is 
normalized by h^. By (4), W commutes with U := (u^ h <^). By (2), (3) and (6), 
(U, W)/W is elementary abelian, and we can proceed as before. □ 

4.3.3. Suzuki groups. We start with analogues of (4.21) and (4.22), based on [Suz, 
p. 133]. This time, F = ¥ q , q = 2 2k+1 and F* = ((). Let 9: x ^ x 2 " +1 be the 
field automorphism such that 9 2 = 2. Then B = U x (h^), where U consists of all 
(a, /?) G F 2 with multiplication rule 

(4.26) (a, /3)( 7 , 6) = (a + 7 , p + 5 + aj e ), 

and W := Z(U) = U' is all (0,/3). Moreover, if e G F* let h e denote the automor- 
phism of U defined by 

(4.27) (a,(3) h * = (ea,e e+1 (3). 

Then (h^) is transitive on the nontrivial elements of both Z(U) and U/Z(U). 
If x G U and v = (i, j, k, I) G Z 4 we write 

(4.28) x v = x ( ^^ l) = x h where h = 

By (4.27), the following trivial identities in the field F 

(0 + i = (C + 1) (C 9 ) + i = (C e + i) 

translate into relations modulo W: 

u (o,o,o,o) u (i,o,o,o) = u (o,o,i,o) w (o,o,o,o) u (o,i,o,o) = u (o,o,o,i) 

for any u e U. These relations correspond to the following collection Mu of subsets 
ofZ 4 :' 

{{(0, 0, 0, 0), (1, 0, 0, 0), (0, 0, 1, 0)}, {(0, 0, 0, 0), (0, 1, 0, 0), (0, 0, 0, 1)}}. 
Since 6 2 = 2 we have A^" 1 )^ 1 ) = \ and \(2S)(e+i) = \& f or any \ e p. Then 

(C 1 ) 9+1 (( 9 ) 9+1 + 1 = ((C + i)-y +1 (C 9 + i) e+1 

(C 2 )9 +I ((c fl ) -I ) fl+1 + 1 = ((C + l) 2 ) 9+1 ((C e + l)" 1 )^ 1 , 

which, by (4.27), imply that 

w (0,0,0,0 )l() (-l,l,0,0) =lc (0,0,-l,l) and ^(0,0,0,0)^(2,-1,0,0) = ^(0,0,2,-1) 

for any w G W . These correspond to another collection A4w of subsets of Z 4 : 
{{(0,0, 0,0), (-1,1,0,0), (0,0,-1,1)}, {(0,0,0,0), (2, -1,0,0), (0,0,2,-1)}}. 

Our presentation of a Borel subgroup of Sz(q) uses Lemma 4.12 to show that U 
is nilpotent of class 2. This relies heavily on the observation that the collections 
Mu^M-u, MwL)M-w and Mu^M-w arc °f finite type. This is almost trivial 
for the first two collections because they contain copies of .M±bcrw up to a linear 
transformation. The next lemma, whose proof is in Appendix B, gives the same for 
the last collection. 
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Lemma 4.29. Let A4s z := A4u U A4-w consist of the following four triples of 
vectors ofZ 4 : 

(a) = {(0, 0, 0, 0), (1, 0, 0, 0), (0, 0, 1, 0)} (b) = {(0, 0, 0, 0), (0, 1, 0, 0), (0, 0, 0, 1)} 

(c) = {(0, 0, 0, 0), (1, -1, 0, 0), (0, 0, 1, -1)} (d) = {(0, 0, 0, 0), (-2, 1, 0, 0), (0, 0, -2, 1)}. 

Then Ms z is of finite type and there are exactly 139 irreducible vectors for Ms z , 
with respect to the length function \(xi, x 2 , x 3 , x A )\ 2 = x\ + 2x\ + x\ + 2x\ for 
(x 1 ,x 2 ,x 3 ,X4,) e Z 4 . 

Moreover, there is a set Sq C Z 4 consisting of 16 vectors such that, if S C Z 4 
contains Sq and is Msz-closed, then S = Z 4 . 

The proof of the first part of the lemma involves tedious computations requiring 
only high school algebra and bookkeeping. More effort is needed for the refinement 
in the second part (see Appendix B). 

Proposition 4.30. There is an infinite central extension of the above Borel sub- 
group B o/Sz(g) having a bounded presentation of length O(logg). 

Proof. We base our presentation on an arbitrary choice of u € U \ Z(U) and on 
w = u 2 e Z(U). Let So denote the set of vectors appearing in Lemma 4.29. We 
will use the following 5 generators and 17 + |So| =33 relations: 

Generators: u, h± for * e {C, ( e , C + 1, C° + !}■ 

Relations: 

(1) [K, h.] = 1 for *, . e {£ C 9 , C + 1, C e + !}• 

(2) w 2 = 1, where w := u 2 . 

(3) w (°' = wwC-bbO.o) in the notation G f (4.28). 

(4) ^(0,0,2,-1) =wu; (2,-l,0,0). 

(5) \wM~ lxm ) = 1- 

(6) [[w m ^]] a = 1 where a := h^h c e. 

(7) yji 1 * ' ' ) = [[ w x2 +1 ]] a , where x 2 * is reduced mod m^(x) in order to 
obtain a short relation. 

(8) u (i.o,o,o) =uu (o,o,i,o) u;i . 

(9) u (o,i,o,o) = tttt (o,o,o,i) Wj . 

(10) [u {l ^^ l \w] = 1 for every (i,j,k,l) e S . 

(11) [u,u^ '°^}=w 3 . 

(12) [[u m <W]] h( = w 4 . 

(13) w (°'b0,0) = [[u x 2k+1 ]]h(W5 . 

Here, toi, . . . are suitable elements of W :— (w^) determined by our choice 
of u and £. 

First of all, these relations are satisfied in the Borel subgroup B: if w = (0, (3) 
then (0,/3)V' l c 1 = (0, Q[3) by (4.27), which yields (6) and (7); (4.27) also yields 
(12) and (13). We have already seen (3), (4), (8) and (9), while (1), (2), (5), (10) 
and (11) are clear. 

By (1), (3), (5) and Lemma 4.1 (with a as above and b := h^ 1 hQe +1 ), W is 
abelian. By (2), (3) and (6), W can be identified with F in such a way that a and 6 
act as multiplication by C and £ + 1, respectively. By (7), acts as multiplication 
by ( e+1 , which together with (3) and (4) implies that h* acts on W as for 

*e{C,C e ,C + i,C ( ' + i}- 
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Once again we use the notation (4.28). We proceed essentially as in Section 4.1. 
Let 

S:={veZ i \ [u v ,w] = 1}. 

Let Ms z be as in Lemma 4.29. 

By Lemma 4.12 (with U = W and W = 1), combined with (3), (4), (7), (8) 
and the fact that [w, W] = 1, the set S is A^sz-closed (Definition 4.9(1)). Since S 
contains So by (10), we have S — Z 4 by Lemma 4.29, so that [u^ % ^' k ' l \w] = 1 for 
all {i,j,k,l) e Z 4 . Thus, each [u,u;W> fc >0] = 1 and hence [u,W] = 1. 

It follows that W = W h< - commutes with U := (u< fc <>). By (1), (8), (9) and (13), 
U is invariant under each h*, and hence contains ((u 2 )^} = (w^) = W. Thus, 
W < Z(U). 

By (1), (8) and (11), we can apply Lemma 4.1 to U/W. Then this group is 
elementary abelian since w = u 2 . By (12), \U\ < q 2 . As in Proposition 4.25, it 
follows that the presented group B is a central extension of B. Once again B is 
infinite. □ 

4.3.4. Ree groups. We cannot prove Theorem A for the Ree groups 2 G2(3 2fe+1 ), but 
we can at least handle their Borel subgroups. Let F — F 3 2*+i, 9: x i— > x 3 + and 
F* = (C). By [KLM], B = U x (h c ) where U = {x(a, b, c)\a,b,ce F} and 

(4.31) x(ai, bi, ci)x(a,2, 62, c 2 ) = x(ai + a 2 , h + b 2 + a\a 6 2 , 

Ci + c 2 - a\a° 2 + a 2 bi + aia e 2 +1 ) 

(4.32) x(a, b, c) h < = x(( 2 - e a, C e_1 6, Cc). 

Proposition 4.33. There is an infinite central extension of a Borel subgroup B of 
2 G 2 (q) having a bounded presentation of length 0(\ogq). 

A sketch of a proof is given in Appendix C. These groups are harder to work 
with than previous ones. In particular, whereas we used Z 2 for SL(2, q), and Z 4 for 
Suzuki groups, for these Ree groups we use Z 6 . 

4.4. Presentations for rank 1 groups. We are now ready to consider the rank 1 
groups SL(2,<7), PSL(2,<?), PGL(2,q), PSU(3,q), SU(3,q) and Sz(q). Except for 
headaches due to field extensions or nonabelian p-groups, the idea in this section is 
the same as the one in Section 3.2 

In each case there is a Borel subgroup B = Ux\{h) 7 with U a p-group. There 
is an involution t (mod Z(G) in the case SL(2,g) with q odd) such that h* = h^ 1 
(or h~ q in the unitary case). These lead us to the Steinberg presentation for these 
groups [St2, Sec. 4]: 

Theorem 4.34. (Steinberg presentation) Each of the above groups has a presen- 
tation using 

(1) any presentation for B, 

(2) any presentation for (h,t), and 

(3) \U\ — 1 relations of the form 

(4.35) Uq — u\hotu 2 , 

with u ,ui,u 2 nontrivial elements ofU and ho G (h) (one such relation for 
each choice ofu n ). 
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We have already used a special case of this presentation in Section 3.2. We now 
use this presentation in order to prove the following 

Theorem 4.36. (a) All of the groups SL(2, q), PSL(2, q), PGL(2, q), PSU(3, q), 
SU(3,<7) and Sz(g) have bounded presentations of length O(logg). 
(b) Each element of each of the preceding groups can be written as a word of 
length 0(\ogq) in the generating set used in the above presentation. 

Proof. We will treat the groups separately. In each case we will show that we only 
need some of the relations (4.35) in order to deduce all of those relations. 

4.4.1. Special linear groups. Here we will be more explicit than in the unitary or 
Suzuki cases. We may assume that q > 9. We will use the notation leading up to 
Proposition 4.19. We will show that SL(2, q) has the following presentation using 
(at most) 6 generators and 22 relations. 

Generators: it, t, h 2 , h a , hb, h^. (If p = 2 discard h 2 .) 

Relations: 

(1) [KM = 1 for*,, e {2, a, 6,0- 

(2) u hb = uu h ° = u ha u. 

(3) u h ? = u 4 . 

(4) u p = 1. 

(5) [[u m *'W]] ha = 1. 

(6) U h < = [[u f <*:>*W]] ha . 

(7) hl = h^ for*e{2,a,6,C}. 

(8) [t\u] = l. 

(9) t — utfu. 

(10) Kt = \[u f >-^]\ ha ■ [[u f *-^\y ha ■ [[u f ""'W]] ha , where . - *" x for 
*G {2,a,6,C}. 

This certainly involves more relations than one might expect ([CRW1] uses at 
most 13 relations) . In [GKKL2] we give a similar presentation (of bit-length 0(log q) 
but not of length O(logg)) using only 9 relations. 

In order to show that the group G presented in this manner is isomorphic to 
SL(2, q), note that SL(2, q) is a homomorphic image of G such that t is in (3.2) and 
B := (u, h 2 , h a , K, h^) (which we identify with a subgroup of G) maps onto a Borel 
subgroup B of SL(2, q). By (l)-(6), we can apply Proposition 4.19: B is isomorphic 
to a central extension of B. 

In particular, U := (u^°^) is a normal subgroup of B, and h a acts irreducibly 
on U. Then U = [U, K] < B' < G' , so that (U, U*) < G' . By (9), t e (U, £/*), and 
then all K <S (U, U*} by (10), so that G = (U, U 1 ) and G is a perfect group. 

In the identification of U with ¥ q in Proposition 4.19, acts as multiplication 
by C 2 . By (7), since h^ 1 1 ^^ 2 ' 9 1 - ) centralizes U it also centralizes U* and hence G. 

Similarly, by (8) we have Z := (h[ q ~ 1)/(2 ' q ~ 1) , t 2 ) < Z(G). 

We claim that G/Z satisfies the presentation for PSL(2,g) in Theorem 4.34. 
For, (h ( ,t)/Z is dihedral of order 2(q - l)/(2,q- 1) by (7). Also, U(h c ,t 2 )/Z 
is isomorphic to a Borel subgroup of PSL(2,g). Moreover, (h^) acts on the non- 
trivial elements of U with at most two orbits; orbit-representatives are u 1 and 
[[u f( - a2{x) }]h a - By (9) and (10), relation (4.35) holds for u = u 1 or [[u s ^ 2{x) )] ha ; 
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conjugating these relations by (h^) produces all of the required relations (4.35). 
This proves our claim. 

The only perfect central extensions of PSL(2, q) arc PSL(2, q) and SL(2, q) (since 
q > 9). Hence, G SL(2, 9 ). 

This presentation is clearly bounded. It can be made short: relations (5), (6) 
and (10) have length Oilogq) by (4.16) and (3.3). (Note that the generators h a 
and hb are not needed here.) 

We still need to verify Theorem 4.36(b). Every element of SL(2,q) is in 
UU t UU t U . Thus, we only need to verify (b) for U, and again this follows from 
(4.16) and (3.3). 

Case PSL(2, q). Replace the previous relation (8) by t 2 = 1. 

Case PGL(2, q). This is proved as in the preceding case, using Lemma 4.20 in place 
of Proposition 4.19. 

4.4.2. Unitary groups. We may assume that q > 5. We use the matrices in (4.21), 
together with the additional matrix 

/0 1 

(4.37) t = -1 

\1 

In the notation of (4.21), if (3 ^ and (3 + + aa — then (4.35) becomes 

(4.38) u(a,/3y = u(-a/0,l//3)h tu(-a//3,l//3). 

We will make fundamental use of a result of Hulpke and Seress [HS] , who showed 
that the presentation in Theorem 4.34 can be shortened by using (1), (2) and a 
carefully chosen set of at most 7 of the relations (3). 

The following presentation uses (at most) 14 generators and 52 relations. We 
will show that it defines a group G that is isomorphic to SU(3,q). 

Generators: u, w, t, h 2l h a , fit, hi for 1 < i < 7. (If p = 2 discard h 2 -) 
Relations: 

(0) All relations (1)-(13) used in Proposition 4.25. 

(14) t 2 = 1. 

(15) h± = v±ivl 2 v±3t for * e {1, 2, a, b, (}, where hi := 1. 

(16) u h * = [[u f *W]} h( for*e {2, a, 6,0- 

(17) u\ — unhitui2 for 1 < i < 7, relations due to Hulpke and Seress [HS]. 

(18) [h c ,h\] = l. 

(19) [h c ,hi] = 1 for 1 < i < 7. 

(20) u hi = [[u^]] h< and u k ^ = [[u g i^]] hi . for 1 < i < 7. 

Here Ui, un,Ui2, v±i,v±2,v±3 are specific elements of U :— (u^ h <' ) ), and /*(x), 

9i( x ), 9i( x ) are specific polynomials in F[x}. 
There is a surjection tt: G — > SU(3,<7) taking each of the generators to "itself". 
This is clear for (0) and (14), while (15) comes from (4.38). Relations (16) and (18)- 
(20) encode the fact that, in SU(3, q), h* and hi are powers of h^ 7 while hi = h± q 
and h\ — h { q . There are at most 7 relations of the form (17) in the presentation 
for SU(3, q) given in [HS]. (All of these relations are given far more explicitly in 
[BGKLP, (11) and (12) on p. 103] and [HS], using (4.38).) 
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Moreover, ir maps B := (u, w, /12, h a , hb, h^) (which we identify with a subgroup 
of G) onto a Borel subgroup B of SU(3, q). Proposition 4.25 implies that B is a 
central extension of B. In particular, since q > 5, U is a normal subgroup of B of 
order q 3 and acts irreducibly on its Frattini quotient U/Z(U). It follows that 
U = [U, h c ] < G'. By (15) for h u we have t G (U, £/*}, and then all K, h e (U, U f ) 
by (15) and (17), so that G — (U, U l ) and G is a perfect group. 

In particular, in the identification of U/Z(U) with ¥ q 2 in Proposition 4.25, acts 
as multiplication by £ 2?_1 (cf. (4.22)), and hence has order d := (q 2 — l)/(3, q + 1) 
in its action on U. 

By (16) and (18), /i£ also acts on U, and is irreducible on U/Z(U). Thus, 
U=(u< h <>). 

Let Z = Z(G). We claim that ft- € (h()Z if /i is any of the elements 
By Proposition 4.25, (19) and (20), h normalizes U and there is an integer k such 
that 1 < k < d and hh^ centralizes U. By (16), (19) and (20), since h l commutes 

with h 1 ^ it also normalizes U = (u^ h <^), and there is also an integer k' such that 
1 < k' < d and (hh^ )* centralizes U. Since n(G) — SU(3,q), we have k = k', so 
that hh^ centralizes (U, U*) = G, which proves our claim. 

We can now use Theorem 4.34 to show that G/Z = PSU(3, q). For, (16) implies 
that (h£,t,Z)/Z has order 2(q 2 — l)/(3,q + 1), and hence is isomorphic to the 
required subgroup of PSU(3,g). Also, BZ/Z = U(al\K)Z/Z = U(h c )Z/Z is 
isomorphic to a Borel subgroup of PSU(3,(7). The only relations remaining to be 
checked for G/Z are those in (4.35), and these are (17) (read modulo Z). 

Since SU(3,g) is an epimorphic image of G, it follows that G = SU(3,g) (see 
[GLS, Corollary 6.1.7]). This presentation is clearly bounded. It can be made short 
using (4.16) and (3.3). 

Finally, Theorem 4.36(b) holds when g € U by (4.16) and (3.3), then also when 
g e U\ and hence whenever g e 1717*1117*11 = SU(3, q). □ 

By Theorem 4.36(b), we also obtain a short bounded presentation for PSU(3, q) 
since a generator of Z(SU(3, q)) has length 0(\ogq). 

4.4.3. Suzuki groups. We may assume that q > 8. This time we will use a re- 
markable result of Suzuki [Suz, p. 128], who showed that the presentation in Theo- 
rem 4.34 can be shorted by using (1), (2) and a single carefully chosen relation (3) 
(in fact, this is just the relation needed to define Sz(2)). 

We use the following presentation having 7 generators and 43 relations. 

Generators: u, w, t, h^, h^ + i,h^e, h^o +1 . 

Relations: 

(0) All 33 relations (1)-(13) in Proposition 4.30. 

(14) t 2 = 1. 

(15) h* = K 1 for * e {£ C + 1. C e ,C e + !}• 

(16) t — U1U2U3, a relation due to Suzuki [Suz, (13)]. 

(17) M - «*i«* 2 u*3 for * e {C, C + 1, C 9 , C e + 1}- 

Here u\, 112, 113, u±i, u±2, u*3 are specific elements of U := (u^ h ^) that de- 
pend on the choices of £ and u in Section 4.3.3. 

Note that, in the Suzuki group case, we do not need additional relations as in 
Section 4.4.2(19), since the power of occurring in relation (16) is 1. 
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It order to show that the group G presented in this manner is isomorphic 
to Sz(q), first note that Sz(g) is a homomorphic image of G such that B := 
(u, h^, h(; + i 7 h^e, h^e + i) projects onto a Borel subgroup B of Sz(g). (For more ex- 
plicit versions of (16) and (17), see [Suz, (13) and (38)].) 

The rest of the proof is essentially the same as in Sections 4.4.1 and 4.4.2. Both 
U (h^) / {h 1 ^ 1 ) and (h^,t) / (h^ 1 ) are as required in Theorem 4.34. Suzuki [Suz, p. 
128] proved that a single relation (16) is the only one of the relations (4.35) needed 
in order to deduce all q 2 - 1 of them. Hence, G/Qijr 1 ) = Sz(q). Finally, G ^ Sz(q) 
since G has no proper perfect central extensions (recall that q > 8; cf. [GLS, pp. 
312-313]). 

This presentation can be made short by using (4.16) and (3.3). It remains to 
consider Theorem 4.36(b). Each element of U or [/* is a word of length O(logg) in 
our generators. Now (b) follows from the fact that Sz(q) = U^UU 1 !/. 

This completes the proof of Theorem 4.36. □ 

We have not dealt with the Ree groups in Theorem A. The obstacle to both 
short and bounded presentations of these groups is the fact that all presentations 
presently known use more than q relations of the form (4.35). 

5. Fixed rank 

In this section we will prove the following result, which implies Theorem A when 
the rank is bounded: 

Theorem 5.1. All perfect central extensions of finite simple groups of Lie type and 
given rank n, except perhaps the Ree groups 2 G2(q), have presentations of length 
0(log|G|) with 0(n 2 ) relations. 

In view of Theorem 4.36, we may assume that the n > 2. The idea in the proof is 
simple: replace the relations in the Curtis-Steinberg-Tits presentation (Sections 5.1 
and 5.2) involving rank 1 subgroups with the short bounded presentation in Theo- 
rem 4.36. 

5.1. Curtis-Steinberg-Tits presentation. Steinberg [Stl] gave a presentation 
for the groups of Lie type involving all roots groups corresponding to the root 
system. Curtis [Cur] and Tits [Ti2, Theorem 13.32] independently obtained similar 
presentations involving only rank 1 and 2 subsystems of the root system. It is the 
latter type of presentation that we will use, since less data is required. There are 
two slightly different versions of the presentation: the "high-level" one, due to Tits 
and based on buildings, that does not explicitly deal with root groups, and the 
more involved one that uses root groups and their commutator relations. For now 
we will state a version of the higher level one, explaining the more familiar one with 
formulas in the next section. 

Each group G of Lie type has a suitably defined root system $ and fundamental 
system II = {a\, . . . ,a n } of roots. We view the twisted groups of type 2 A m (q) 
(with m > 3 and m = 2n or m = 2n + 1), 2 D m (q) (with m > 4), 2 E 6 (q) or 3 D 4 (q) 
as having root systems of type C„, B m _i, F4 or G2, respectively. There is also a 
suitable sense in which 2 F^(q) has 16 roots [GLS, Corollary 2.4.6]. 

Theorem 5.2. (Curtis-Steinberg-Tits presentation [Cur, Stl, Ti2] and [GLS, The- 
orem 2.9.3]) Let G be a simple group of Lie type and rank at least 3. Let II be a 
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fundamental system of roots in the root system. Whenever a G II let L a be the cor- 
responding rank 1 subgroup of G; whenever a, (3 € II, (3 ^ ±a, let L Q ,/3 :— (L a ,Lp) 
be the corresponding rank 2 subgroup of G. Then the amalgamated product of the 
rank 2 subgroups L a ^, amalgamated along the rank 1 subgroups L a , is a central 
extension of G. 

Of course, the above version of the Curtis-Steinberg-Tits presentation also gives 
no information concerning presentations of rank 2 groups. For these we need to 
use commutator relations. The commutator version of the Curtis-Steinberg-Tits 
presentation is in [Cur], as well as [GLS, Theorem 2.9.3], and will be described in 
the next section. 

5.2. Commutator relations. Bounded rank groups were considered in [BGKLP]. 
We will use the method of that paper, inserting our presentations for rank 1 groups 
in place of the standard ones used there. In that paper, all commutator relations are 
correctly presented, as are the definitions of the toral elements hp. However, there 
were significant errors in that paper: for most of the twisted groups, the actions 
of hp on the root group elements x a (a) (or x a (a,b) for odd-dimensional unitary 
groups) were not correct. The methodology used was, however, correct, and will 
be used here without any explicit formulas. The same methodology was used in 
[KoLu], though not for all possible types of group and without considerations of 
lengths of presentations. 

We assume that G is a universal group of Lie type [GLS, p. 38]. We will factor 
out part or all of Z(G) later in this section; this point of view occurred in the 
Section 4.4, and will also be used in Section 6. We will ignore the tiny number 
of cases in which G has a perfect central extension by a group whose order is the 
characteristic [GLS, p. 313]. 

With each root a £ $ there is associated a rank 1 group L a = SL(2, q), PSL(2, q), 
SL(2,<7 2 ), PSL(2,<7 2 ), SL(2,g 3 ), SU(3,g) or Sz(q). As indicated above, we use the 
presentation P a for L a obtained in Section 4.4. Among our generators of L a we 
have the following: a torus (h a ); a p-element u a lying in a root group U a of L a 
and of G (note that U a = {u£"**) except when L a ~ SU(3, 2)); and a "reflection" 
r a (called "i" in Section 4.4) that is an involution modulo Z(L a ) such that r a 
normalizes (h a ) and U^ a = U- a is the opposite root group (relative to $). When q 
is tiny the generic presentations in Section 4.4 are not applicable, but there certainly 
are short, bounded presentations using at least the above generators. 

We have L- a = L a . Since G is universal, so is L a . Checking all cases, we find 
that 

(5.3) \h a \ = \U*\, where U* := U a \{l} for U a := U a /*(U a ) 

(here §(U a ) denotes the Frattini subgroup of U a ). 

There is a Borel subgroup B associated with the positive roots in $. It contains 
U a for all positive a and has a maximal torus H := (h a \ a £ II). Moreover, H 
normalizes each root group U a , and acts on U a as multiplications by elements in 
an extension field ¥ qC ( a ) of ¥ q , where |f7 a | = q e ^ a \ e(a) < 3, and 3 occurs only for 
3 D 4 (q). 

By Theorem 5.2 we need to consider all of the subgroups L aiyCCj , i ^ j, arising 
from our fundamental system II = {a\, . . . ,a n }. Here L auaj is a rank 2 group 
whose root system § ai , aj has the fundamental system {ai,aj}. Crucial for our 
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argument (and those in [BGKLP, KoLu]) is the fact that H has boundedly many 
orbits on the direct product of suitable pairs U a , Up. For this we need the following 

Lemma 5.4. Assume that L := L ai Ccj is not SU(5,2). Let a and (3 ^ ±a be in 
$a t ,a 3 - Ifu a € U a \®(U a ) and up £ Up\$(Up), then C a .p := C( haM ((u a ,up}) is 
in the center of the rank 2 subgroup X a ^p :— (U a , U- a , Up, U-p) of L. Moreover, 
either 

(a) C Qi/ 3 centralizes L, 

(b) q is odd and X ai p = SL(2, q) o SL(2, q) or SL(2, q) x SL(2, q) with center 
C a% p, inside Sp(4, q), SU(4,g) or G 2 (q), 

(c) q is odd and X a ^p = SL(2, q) o SL(2, q 3 ) with center C a ,p, inside z D±(q), 

(d) q is odd and X a p = SU(4, q) with center C a ,p, or 

(e) X ai p — SL(3, q) with 3\q — 1, inside L = G2(q) or 3 D 4 (g) . 

Proof. As already noted, U a — (ua la ' > ) since L is not SU(5, 2). Since H acts on 
L a as a group of automorphisms fixing U a and U- a , we then have Cn{u a ) = 
CniUa) = Ch{Lo). The latter equality can be seen by inspecting each of the rank 
1 groups L a ; but it is more easily understood by noting that Ch{Uo) normalizes 
each member of {U a } U (U- a ) Ua = {U a ) L ~. 

Consequently, C a .p centralizes the rank 2 subgroup X a ,p; we may assume that 
C at p does not centralize L. There are very few orbits of unordered distinct pairs 
{±a}, {±/3} under the action of the Weyl group of L on $ QiiQj . . When G does not 
have type ^(g) it straightforward to calculate X ai p in each case and then to check 
that it appears in (b)-(e). The case G = L = ^F±{q) cannot occur in view of the 
orders of the centralizers of semisimple elements obtained in [Shi] . □ 

We now describe the version of the Curtis-Stcinbcrg-Tits presentation using com- 
mutator relations [Cur] (cf. [GLS, Sec. 2.9]). For all roots a, (3 ^ ±a lying in the 
same rank 2 subsystem spanned by a pair of members of II we use the following 
two types of relations: 

(P a ) A presentation P Q of L a , including the elements u a , h a and r a de- 
scribed above. Let U a be as above. 
(B a ,p) [x a ,xp] — product of elements of root groups Ui a+ jp with ia+jfi e $, 
i, j > 0, for each nontrivial x a £ U a ,xp £ Up. 

Here and below, the terms of the product are assumed to be exactly the ones that 
occur in the given group G. Precise formulas for these products are given in [Stl, 
Sec. 10], [Gr] and [GLS, Theorem 2.4.5]. 

This gives the standard presentation for G. We will soon shorten this presenta- 
tion with the help of one further type of relations for the same pairs a, (3: 

(H at p) xlt = * Q for each nontrivial x a € U a , where * a € U a depends on both 
x a and hp. 

The crucial if simple observation is that, since (h a ,hp) normalizes U a and Up by 
(P a ), (Pp) and (H a ^p), it acts by conjugation on the set of relations {B a ^p). Hence 
we will use (some of) the relations {H aj p) to deduce all of the relations {B at p) 
from a small number of them. Specifying the latter subset is mostly a matter of 
bookkeeping. 

We break the proof of Theorem 5.1 into several cases. 
Rank 2 case: Define z — z{G) as follows: 
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G= SL(3,g) Sp(4,g) G 2 (q) SU(4,g) SU(5,g) 3 D 4 (q) 2 F 4 (g) 
z=(3,q-l) (2,q-l) (3,g-l) (4,g+l) (20,g+l) (3,g-l) 1 
so that z < 20. 

Recall that H — (h ai ,h a2 ) acts on each U a as a group of field multiplications in 
an extension field F* e(a) . Let x afl £ U a , 1 < [i < z, denote elements corresponding 
to coset representatives in F* e(a) of the zth powers of all elements of F* e(a) (or 
projecting onto such coset representatives in U a for nonabelian U a in the SU(5, g) 
and 2 F 4 {q) cases). 

We now replace the preceding presentation by the following one for a group J 
(using the same pairs a, (3 as above). 

(P a ) As before. This also provides us with the various x a/J/ , which we may 
assume are among the generators of L a . 
(B' a ,p) [ x an,xpv\ — product of elements of root groups Ui a+ jp with ia + j(3 £ 
<&, i,j > 0, for all /i, v. 

{H' a p) Xafi — * a for all /i, where * a € U a depends on both x afl and hp. 

We will show that J = G. Since G is a homomorphic image of J, for each root 
a we have a subgroup L a behaving as in (5.3). 

Once again, U a = (x^ La ' > } if x a G U a \Q(U a ). The action of H on each U a as a 
group of field multiplications is encoded in (P a ), (Pp) and (H' a ). 

For some perpendicular pairs a, (3 the relations (B' a g ) state that [x alJ/ ,xp v ] = 1. 
For such a pair a, (3 we only need one such relation with x afl G U a \<f>(U a ) and 
xp v e Up\$(Up), since conjugating by all elements of (h a ,hp) then implies that 
[U a , Up] — 1, and hence that all relations (B a ,p) hold. 

We deal with (B' a o) for all remaining pairs a, (3 in a somewhat similar man- 
ner. Lemma 5.4 implies that |C ail g| divides z. By (5.3), each orbit of (h a ,hp) on 
Ua x &f has size \U# x Up\/\C at p\. Hence, our pairs (x afl ,xp u ) include represen- 
tatives for all (h a , ft^-orbits on Uf x 17*. 

Conjugating the relations (B' a p) by all elements of H, and using (H' a p), we 
obtain all relations (B a ^) if U a and Up are abelian. When U a or Up is nonabelian, 
for each pair (x a ^(U a ), xp<&{Up)) of cosets wc still obtain a relation of the form 
(Ba,p)- By using the elementary identity [i, ra] = [x, v] [x, u] v in both G and the 
presented group J, we see that our conjugates of the relations (B' a p) imply all 
relations (B a jj). 

Thus, all relations (P a ) and (B a p) required in the Curtis-Steinberg-Tits presen- 
tation hold for J, so that J = G. 

There arc at most 20 2 relations (B' a /3 ) and 20 relations (H' a /3 ) for each choice 
of the roots a, (3. The length of this presentation is O(logg): each factor appearing 
in (B' a g), and each * Q , has that length by Theorem 4.36(b). 

General case (excluding SU(2n + 1,2)): Let z(G) be the least common multi- 
ple of the integers z{L ai ^ aj ) for the various rank 2 groups L au0tj just considered. 
Introduce elements x a ^ E U a , 1 < /i < z(G), that behave as before. 

Theorem 5.2 uses all L ai , aj , including ones of type A\ x A\ occurring when the 
roots cti and aj are perpendicular. Therefore we need to consider the corresponding 
relations (B' a /3 ) and (H' a (3 ) for a and (3 in the root system spanned by cxi and ay. 
As above, in the A\ x A\ case one relation (B' a ) with x afl <E U a \&(U a ) and 
xp v e Up\$(Up) implies all relations (B at p). 
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As before we see that all (P a ), (B' a (3 ) and (H' a /3 ) imply all relations {B a ,p). 
Once again, J = G. 

The case SU(2n + 1, 2): For each long root a lying in the span of two fundamental 
roots, we introduce two elements x afl that generate the quaternion group U a . We 
can now proceed exactly as above. 

Summary: Each relation (B' a p) involves a bounded number of elements of various 
rank 1 groups L 7 ; each of these elements has length (^(logq) in the generators of L 1 
by Theorem 4.36(b). A similar statement holds for (H' a p). We needed to consider 
0{n 2 ) pairs a, (3. 

Centers: While this is a presentation for the universal group G, the center can be 
killed exactly as in [BGKLP, Sec. 5.2]. For example, for groups of type A n there is 
a standard product of the form n™ 1 ^ (£'), \£\ = (? — 1, n + 1), that generates the 
center. By Theorem 4.36(b), this expression has length 0(n log q) in our generators. 
Thus, within our length requirements we can factor out all or part of the resulting 
cyclic group (a very different approach to this is used later in Section 6). For the 
remaining types it is also easy to write the required central elements as short words 
in our generators. 

We have now handled all cases of Theorem 5.1. □ 

Note that this presentation often involves more elements x afl and relations (B' a p) 
than are actually needed. For example, for odd-dimensional unitary groups we never 
need 20 coset representatives: we only need at most 4 or 5 for any pair a, (3 of roots 
(cf. Lemma 5.4). Different examples occur in [GKKL2]. 

Remark 5.5. For future reference we mention two elements of SL(n — l,q) that 
can be readily found using the above generators. By Theorem 4.36(b), 

^100 \ /0 10 \ 2 l-\ 0^ 
001O -1 O _ 001O 

0-10 001 "010 

V o ij\o 1 1 



C23 := 



V o 1) 



is an element of L a2 L ai , and hence has length 3 in our generators. In the monomial 
action on the standard basis e\, . . . , e„, we have C23 = (ei, — ei)(e 2 , 63). Hence, 

C2345 := (ei, -ei)(e 2 ,e 3 )-(ei, -ei)(e 3 ,e 4 )-(ei, -ei)(e 4 ,e 5 ) = (ei, -ei)(e 2 , e 3 , e 4 , e 5 ) 

has length 9. Compare Proposition 5.6. 

5.3. Word lengths. The preceding remark is a very special case of the following 
observation: 

Proposition 5.6. If n is bounded in Theorem 5.1, then every element of every 
central extension of G is a word of length O(logg) in the generators used in the 
theorem. 

Proof. The elements r ai generate the Weyl group W modulo H. By hypothesis, 
|$| and \W\ are bounded. By the Bruhat decomposition, G = {J weW BwB with 
B = UH a Borel subgroup. Here U and H are products of fewer than |$| subgroups 
of L a , a <G Since each L a is a VF-conjugate of some L ai , Theorem 4.36(b) implies 
the result. □ 
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6. Theorem A 

We now turn to Theorem A. By Theorem 5.1, we only need to consider the 
classical groups of rank greater than 8. 

6.1. A presentation for SL(n, q). We begin with the case SL(n, q). We would 
like to use the Weyl group S n in our presentation, but SL(n, q) does not have a 
natural subgroup S n when q is odd. There are various ways around this difficulty, 
such as using a subgroup (2 n xiS n ) n SL(n,q) or the alternating group (compare 
[GKKL2]). We have chosen to use S n -i as an adequate substitute for S n . 

Theorem 6.1. All groups SL(n,q)/Z, where Z < Z(SL(n,q)), have bounded pre- 
sentations of length 0(\ogn + logg). 

Clearly the most interesting cases are Z = 1 or Z(SL(n, q)). However, later we 
will also need the case \Z\ = 2. 

Proof. We start with two bounded presentations of length 0(log n + log q) given in 
Theorems 3.17 and 5.1: 

• T = (X | R(X)) of Sn-! (acting on {1,2,..., rc}, fixing 1), and 

• F= (X | R) of SL(5,g). 

The notation R(X) is used here since later we will need to use a second 
copy (X | R(X)) of S n . 

We assume that X and X are disjoint, and that these presentations 
satisfy the following additional conditions for some X\, X 2 C X and Y C X: 

(i) (X\ U X 2 ) projects onto the stabilizer T 2 of 2. 

(ii) X\ projects into A n -i, and each element of X 2 projects outside A n _i. 

(iii) a;(2,3)) £(2,3,4,5), ^(6,7), £(5+«,...,n) are words of length O(logn) in X 
that project onto (2,3), (2,3,4,5), (6, 7), (5 + S, . . . , n), respectively, 
where S = (2, n — 1). 

(iv) x a is a word of length O(logn) in X and projects onto a — (2, ... ,n) 
(this is needed only for handling the center of SL(n, q)). 

(v) The elements u,t 7 h^ in (3.2) and (4.18), that generate a subgroup 
L = SL(2,gr), are in X. When q is odd, we also assume that I12 € X. 

(Here we are using the subgroup ^ ^ 01 SL(5, q).) 

(vi) C23 and C2345 are words of length 0(log q) in X acting as in Remark 5.5. 

(vii) (Y) = SL(4, q), \Y\ = 2 and the members of Y have length O(logg) in 

X. (Here we are using the subgroup ^Jj SL(4 q^j °' ? ^L(5,g).) 

(viii) X contains a set of generators of the stabilizer T 2 3 = T 2 n T 3 of both 
2 and 3. (This will only be needed later when we deal with some 
orthogonal groups in Section 6.2, Case 2.) 

Existence: In Remark 3.22(i) we constructed the permutations in (iii) and (iv); in 
Remark 3.22(ii, iii) we noted that conditions (i) and (ii) hold; and Remark 3.22(iii) 
also takes care of (viii). Remark 5.5 finds the elements in (vi). Note that the group 
L = L ai , and a presentation for it using the generators in (v) among others, were 
essential ingredients in Section 5.2 when q > 9. For (vii) see Proposition 5.6. 

Now our presentation is as follows: 
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Generators: X U X (we are thinking of T and F as embedded in SL(n,q) as 

{% P e— ) — — — - (T ?)• ~P~ 

tively). 

Relations: 

(1) R(X)UR. 

(2) u Xl = m, S 11 = u, for all £i e Xl (where we have abbreviated u :— u l ). 

(3) u l! = u- 1 , u* 2 = u"\ for all x 2 e X 2 . 

(4) [fe ( ,IiUl 2 ] = l. 

(5) £(2,3) = C 2 3, £(2,3,4,5) = C 23 45- 

We will show that the group G defined by this presentation is isomorphic to 
SL(n,q). There is a natural surjection tt: G — > SL(n, q). (For, in view of (ii), 
relation (3) express the fact that each odd permutation maps the first basis vector 
to its negative, while the even permutations in (2) fix that vector, as do the elements 
off.) 

By (1) and Lemma 2.3, G has subgroups we can identify with T = (X) and 
F=(X). 

We have (u, u, h^, £(2,3,4,5)) = F since (u,u,h^) is L = SL(2,g). By (i), (2), (3) 
and (4), X\ U X 2 acts on {u ±1 ,u ±1 , h^}, so that the elements £(6,7) and £(5+5,. ... n ) 
of T 2 = {Xi U X 2 ) act on F. Moreover, (2(2,3), £(2,3,4,5)) = (023,02345) < F by (5). 
Thus, 

|^t(^)| > I (^(2,3), £(2,3,4,5) ,£(6,7), £(5+5,. ..,n)) I = |5*4 X S n - 5 \, 

while |F T | > |tt(F t )| = ( n ~ 1 ). It follows that T acts on f T as it docs on the set 
of all 4-sets in {2, ... , n}. 

The subgroups U12 = (u^^) and U21 — (u^ h ^) of F are root groups order q. 
Once again, by (i), (2), (3) and (4), T 2 = (X\ U X 2 ) normalizes each of them. As 
above, it follows that T acts on both {Ui 2 ) T and {U 2 \) T as it does on {2, ... , n}. 

Both C/31 = (?/2i) C23 and C/23 = [C^i2,C^3i] are also root groups of F. Since 
(T 2 ) C23 = T 3 normalizes (U 2 i) C23 = U 31 , it follows that T 2 n T 3 normalizes J7 23 . As 
above, we find that |(C/23) T | = (n — l)(n — 2). 

Thus, T acts on both (?7i2) T and (U 2 i) T as it does on singletons, and on {U 2 z) T 
as it does on ordered pairs from our (n — l)-set. By the 4-transitivity of T, any 
given pair from {U\ 2 ) T U {U 2 i) T U (J7 2 3) T can be conjugated into F by a single 
element of T. 

Consequently, N :— (L T ) = SL(n,q) by the Curtis-Steinberg-Tits presentation 
(as in Section 5.2 or Lemma 2.10). Moreover, N < G, and G/N is a homomorphic 
image of (T) = 5„_i in which, by (5), a transposition is sent to 1. Thus, G/N = 1. 

This proves the theorem when Z = 1. 

Now that we have a presentation for SL(n, g), we need to factor out an arbitrary 
subgroup Z of its center. However, a generator of Z probably cannot be written as 
a short word in our present generators. In order to deal with this obstacle we will 
use the following lemma to increase our generating set: 

Lemma 6.2. Suppose that G is a finite group containing the wreath product 
H l S m — S m k H m for some group H. Let (Y \ S) be a presentation for G of 
length < I and (Y \ S) a presentation for S m of length < I. Fix e € H and let 
hi j be the element in H m with coordinates 1 everywhere aside from ith coordinate 
e and jth coordinate e _1 . Assume that 
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(a) The following elements of the standard subgroup S m < S m K H m can be 
written as words of length < I inY: a = (1,2, ... , to), [i — (2, . . . , m) and 
r 2 = (2,3); 

(b) The elements a and t 2 in (Y \ S) corresponding to the above ones can be 
written as words of length <linY; and 

(c) /ii,2 can be written as a word of length < I inY . 
Then the following is a presentation for G, of length < 13/: 

Generators: YuYu{d}. 
Relations: 

(1) sus. 

(2) a = ad. 

(3) f 2 = T 2 h 2 ,3- 

(4) [d,T 2 ] = [d,»] = [d,h 2 , 3 ] = l. 

Moreover, d maps to the element (e 1_m , e, . . . , e) G H m < G. 

Note that Y and Y are unrelated. It is the subgroups (er, t) and (&,f), both 
isomorphic to S m , that are related via the isomorphism "bar" ; and we will see that 
they are conjugate in S m k H m . Also note that e is used to define the element h 2y3 
and hence is essential for (3) and (4). 

Proof. If 7r e S m and (oi, . . . , a m ) € H, then in S m k H m we have 
(oi, . . . , a m )ir = 7r(a 7r(1 ), . . . , a 7r(m) ). 

In particular, (ai, . . . , a m ) CT = (a m , ai, ■ • ■ , « m -i) is "pushing to the right". 

Let J be the group presented above. It is straightforward to check that J sur- 
jects onto G, by taking d to be (e 1_m , e, . . . , e) and letting the new copy of S m be 
conjugate to the standard copy via (e^ 1 , e m ~ 2 , . . . , e 2 , e, 1) G _ff m . It is also easy to 
check that this presentation has length < 13Z. 

By Lemma 2.3, J has subgroups we can identify with G = (Y) and S m = (Y). 
It suffices to prove that J < G. 

Relations (4) imply that d commutes with = (k,k + 1) and hk,k+i whenever 
2 < k < to. Hence, by (2) and (3), 

r 3 = (r 2 r" = (T 2 ^ 3 ) d " ff " - {rf'h^Y = [t^Y' 1 = r 3 h 3A . 

Similarly, induction gives fk = Tkhk,k+i whenever 2 < k < to. A similar equation 
also holds for r m := (to, 1): 

I'm (^m— 1) _ — ij~m— \h"m— l,m) (^m— l^m— l,m) — ^~mhm,l- 

Consequently, f 2 , . . . ,f m e 5 TO k i? m < G. It follows that our second copy (Y) of 
5 m lies in G, and hence so do a, Y and d (by (2)). Thus, J = (Y U Y U {d}) < G, 
as required. 

In order to explain the motivation used here, we include the following direct 
calculation that d behaves as desired: 

<y = t 2 t 3 ■ ■ ■ f m -if m 

= T2^2,37"3^3,4 ' ' ' T m _l/l TO _l iTO T m /l TO> l 
= T 2 T 3 • • • T TO _iT TO /l2,l/i3,l ' ' ' h m -l,lh m ,l 

d = a 1 a = /i2,i/i3,i • • • h m -i t ih m ,\. □ 



PRESENTATIONS OF FINITE SIMPLE GROUPS 



43 



When we use this lemma, the bound I will be taken to be 0(logn + \ogq). 

Completion of the proof of Theorem 6.1: Let Z = (el). In the preceding lemma let 
tyl = n — 1 and let H = (e) be the subgroup of F* of order |e|. Embed H 71 ^ 1 into 
G = SL(n, q) as all diag(ei, . . . , e n ) with ei € (e) and LJ e, = 1. We already used a 
presentation (X \ R(X)) for S n -i above in (i), where S n -i permutes the last n—1 
coordinates. We use the corresponding presentation (X | R(X)) for a second copy 
of S n -i (with X and X disjoint). 

First we need to verify conditions (a)-(c) of the lemma. We temporarily use 
the notation in Remark 3.22(i): a = (2, 3, ...,n) and z = (2,3) have the re- 
quired length, hence so do \i = (2,3)<r and (3,4) = (2,3) CT . It follows that 
Lemma 6.2(a) holds, and hence so does (b). It remains to consider (c). The group 
L = SL(2,g) was defined above in (v). Use Theorem 4.36(b) to write the ele- 
ment d! = diag(e, e _1 , 1, . . . , 1) S L as a word of length 0(\ogq) in our generators 
(cf. (v)). Then /i 2 ,3 = d! a has the length required in (c). 

The lemma provides a new bounded presentation for G of length (9(log n + log q), 
including a new generator d representing the diagonal matrix diag(l, e 2 ~ n , e, . . . , e). 
Then Z = (dd 1 ), so that the additional relation dd' = 1 produces the desired factor 
group. □ 

6.2. Generic case. In this section we obtain short bounded presentations for the 
universal central extensions of the simple groups of Lie type. This is significantly 
simpler than dealing with the simple groups, which involves factoring out the centers 
of the universal extensions. However, in some cases the latter is easy: it is a matter 
of choosing slight variations on the subgroups we amalgamate. 

We begin with some general properties of these universal central extensions: 

Lemma 6.3. Let G be a simple classical group defined on a vector space of dimen- 
sion > 8, and let G denote its universal central extension. 

(1) If G = PSL(n, q) then G = SL(n, q) . 

(2) If G = PSU(n, q) then G = SU(n, q). 

(3) If G = PSp(2n, q) then G = Sp(2n, q). 

(4) IfG = fl(2n+l,q) then \Z(G)\ = (2,q-l). 

(5) IfG = Pn + (2n,q) then \Z(G)\ = (4,q n - 1), and Z(G) is cyclic unless n 
is even. 

(6) IfG = PfT(2n,g) then Z(G) is cyclic of order (4, q n + 1). 

(7) Suppose that G = Cl(V) is an orthogonal group over a field of odd charac- 
teristic. Then Z(G) has an involution, the spin involution, lying in Z(H) 
whenever H = Q(U) for a nondegenerate subspace U of V of dimension at 
least 3. 

Proof. See [GLS, pp. 312-313] for (l)-(6) and [GLS, Proposition 6.2.1(b)] for (7). □ 
Note that there are various exceptions to (l)-(6) when the dimension is at most 
8 [GLS, p. 313]. 

We will use the following crucial theorem as well as variations on its proof. 

Theorem 6.4. All universal central extensions of groups of Lie type of rank n over 
F q have bounded presentations of length 0(logn + log 5). 

Proof. By Theorem 5.1, we may assume that G is a classical simple group of rank 
n > 8. We will use the root system of G and the Curtis-Steinberg-Tits presentation 
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(Theorem 5.2). For each root a there is a corresponding subgroup L a that is 
a central extension of PSL(2, q), PSL(2, q 2 ) or PSU(3, q). We may assume that 
II = {ai, . . . , a n } is a fundamental system of roots (in the standard order), where 
ai, . . . , a n -i have the same length. 

Case 1: G does not have type D n . Let 

111 = {ai, • • • , a n -i}, n 2 = {a\, . . . , a n - 2 , a n } and n 3 = {a n _i,a n }. 

1 2 n — 2 n— 1 n 
• • • • • • » • 

Remark 6.5. These sets of roots have the following properties: 

(a) For i = 1,2,3, either | H» J = 2 or G, := (Z^ | /? e IF) is of type SF d , 
SLd x SL 2 or SLd x SU3 for some d; and 

(b) Each pair from II lies in some IF. 

Namely, (b) is clear, and G\ is a homomorphic image of SL(n, q) or SL(n, g 2 ) (as 
is seen from the commutator relations in Section 5 - or more precisely in the case 
of odd-dimensional unitary groups, from the explicit relations in [Gr, BGKLP]). 
Moreover, G2 has type SL„_i x SL 2 or SL„_i x SU3, while |n 3 | = 2, which proves 
the remark. 

The root aj (1 < i < n) can be identified with the ordered pair + 1) of 
elements of the set {1, . . . ,n} on which the Weyl group S n of G\ acts. The cor- 
responding root groups X± ai arc just groups of elementary matrices of G\, and 
generate a subgroup L ai = L^i+i = SL(2, q) or SL(2, q 2 ) acting on the span of two 
of the standard basis vectors of the usual n-dimcnsional module for G\ . 

We use the following groups and presentations. 

• G\ is the universal central extension obtained in Theorem 6.1 (but reversing 
the order of the set {1, . . . , n}), given with a bounded presentation (X\ \ Ri) 
of length O (log n + \ogq) including 

(i) generators inside X\ for T = S n -\ acting on {1, . . . ,n} and fixing n, 
and for T^_ 1 , the stabilizer of n — 1 in T' = A„_i; 

(ii) a bounded presentation (X(n — 1) | R(n — 1)) of length 0(logq) for 
the subgroup L a<n _ 1 ] and 

(iii) a set X(n — 2) of generators for the subgroup L an _ 2 of G\. 

Existence: Note that Remark 3.22 (iii) provides short generators for 
T^_ 1 , as required in (i). The groups L an _ 1 and L an _ 2 were used in the 
presentation for F in the proofs of Theorems 5.1 and 6.1, so that we also 
already have X(n — 2) U X(n — 1) C X\ and R(n — 1) C as required in 
(ii) and (iii). 

Then (L an _ 2 ,T^_ 1 ) ^ SL(n - l,q) or SL(n - 1, q 2 ), since T' n _ x fixes 
the corresponding standard basis vector (whereas each odd permutation in 
T n _i sends that vector to its negative; cf. Section 6.1). 

• G3 = (X3 I i?3) was obtained in Section 5 by using the root system II3 and 
bounded presentations of length 0(log q) for two rank 1 groups, namely 

(iv) the same presentation (X(n — 1) | R(n — 1)) used above for L ctn _ 1 ; 
and 

(v) a bounded presentation (X(n) \ R(n)) of length 0(\ogq) for L an . 
Hence, we already have X(n — 1) U X(n) C X 3 and R(n — 1) U R(n) C i? 3 . 
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Our presentation is as follows: 

Generators: X\ UI3. 
Relations: 

(1) R!UR 3 . 

(2) Identify L a _ r inside Gi and inside G3 using the identity map. 

(3) [L an _ 2 ,L an ] = [T^_ 1 ,L an ] = l. 

Then G 2 := (ia n _ 2 ,7^_i) ■ L an = (Lp \ e IT n H 2 or U 2 n II3) is a central 
product of an SL„_i and an SL2 or SU3. In view of Remark 6.5(b), we obtain the 
desired universal central extension G by Theorem 5.2. This presentation is clearly 
short and bounded. 

Case 2: G has type D n . We assume that a n _i and a n are connected to a„_2 in 
the Dynkin diagram. This time we use 

III = {ai, . . . , a„_i}, n 2 = {ai, . . . , a„_ 3 , a„} and II3 = {a„_2, a„_i,a n }. 

Once again Remark 6.5 holds, this time with Gi,G 2 and G3 the universal central 
extensions SL(n, q), SL(n — 2, g) x SL(2, g) and SL(4, q), respectively. This time we 
use the following presentations. 

• Gi is the universal central extension obtained in Theorem 6.1 (but once 
again reversing the order of the set {1, . . . ,n}), given with a bounded pre- 
sentation (Xi I Ri) of length (9(logn + logg) including 

(i) a bounded presentation (X(n — 1) | R(n — 1)) of length Oilogq) for 

(ii) a bounded presentation (X(n — 2) | i?(n — 2)) of length O(logg) for 

(iii) a bounded presentation (X(n — 3) | R(n — 3)) of length 0(logq) for 
L an 3 ; and 

(iv) generators inside Xl for T = S n -\ acting on {1, . . . , n} and fixing n, 
and for the stabilizer T^ L _ l n _ 2 of both n — 1 and n — 2 in X" = A n . 

The groups L = L ctn _ 1 , L an _ 2 , L an 3 were used in the presentation for F 
in the proof of Theorem 6.1, so that if^z\X{i) C Xi and U^Igi?^) C R 1 . 
Generators for T' n _ x rl _ 2 were obtained in Remark 3.22(iii). (Recall that 
the numbering in that remark has been reversed in this proof.) This time 
(ia B _s,r;_ lin _ 2 )SSL(n-2,g). 

• (X3 I R 3 ) is a bounded presentation of length O(logg) for G3 = SL(4, q), 
obtained as in Section 5 using the presentations (i) and (ii), together with 
the root system IT3 and 

(v) a bounded presentation (X(n) | R(n)) of length 0(\ogq) for L an . 
Note that U™_ 2 X(i) C X 3 and \J\\_ 2 R{i) C R 3 . 

Our presentation is as follows: 

Generators: li U I3. 
Relations: 

(1) fliUtfs. 

(2) Identify (L an _ 2 , L an _ 1 ) = SL(3, q) inside Gi and inside G 3 using the 
identity map. 

(3) [-^a n _3) ^a„] = K-l,n-2i -^aj = 1- 
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Once again G 2 := (L an _ 3 ,T^_ l n _ 2 ) ■ L an = (Lp \ [3 e II i n U 2 or Yl 2 n II3) 
behaves as required. In view of Remark 6.5(b), we obtain the desired universal 
central extension G by Theorem 5.2. □ 

6.3. Symplectic groups. We will need to factor out the centers of the various 
groups in Theorem 6.4. The following simple observation will make this easy in 
many cases: 

Lemma 6.6. Suppose that q is odd. 

(i) If n is even then SL(n, q) has a bounded presentation of length 0(\ogn + 
logq) in which the involution z in the center o/SL(n, q) is a word of length 
0(\ogn + log in the generators. 

(ii) Ifn is odd then SL(n, q) has a bounded presentation of length 0(log n+logq) 
in which the central involution z in any given Levi subgroup SL(n — 1, q) is 
a word of length 0(logn + log 9) in the generators. 

Proof. Let G = SL(n,g). 

(i) In the completion of the proof of Theorem 6.1, we found a presentation for 
G in which z = dd' behaves as stated when e is chosen to be —1. 

(ii) Here we proceed exactly as in the proof of Theorem 6.4 for the present group 
G instead of the other classical groups G considered in that theorem, using the 
presentation for G\ = SL(n — 1, q) obtained in (i). □ 

Note that (ii) also can be proved using Lemma 6.2. 

Corollary 6.7. PSp(2n, q) has a bounded presentation of length (9(logn + logg). 

Proof. The proof of Theorem 6.4 for G = Sp(2n, q) involved a presentation for 
G\ = SL(n, q). This time we will use the presentation for SL(n, q) in the preceding 
lemma. 

If n is even then the additional relation z = 1 in Lemma 6.6 (i) produces the 
desired presentation. 

Suppose that n is odd. Let ei, . . . , e n , fi, . . . , f n be a hyperbolic basis of the 
F g -space underlying the group G. Choose this basis so that L ai has support 
(ei, e i+ i,fi, fi+i) for i < n and (e n , f n ) for i = n. Let H = (X± ai | i < n - 2) = 
SL(n— 1, q), with support the orthogonal complement (ei, . . . , e„_i, /1, . . . , f n -i) of 
the support (e„, /„) of L an = Sp(2,g). By Lemma 6.6(h), the involution z € Z(H) 
is a word of length 0(\ogn+\ogq) in our generators. On the other hand, the central 
involution z' of L an has length 0(\ogq) by Theorem 4.36(b). Hence, the additional 
relation z = z' produces the desired presentation. □ 

6.4. Orthogonal groups. 

Theorem 6.8. All perfect central extensions of simple orthogonal groups of dimen- 
sion N over ¥ q have bounded presentations of length 0(logN + logq). 

Proof. We may assume that N > 8 and the center of G is nontrivial. By Lemma 6.3, 
q is odd and there are various quotient groups to consider. 

6.4.1. Factoring out the spin involution. For each simple orthogonal group G we 
need to factor out (s) from G, where s is the spin involution in Lemma 6.3(7). 

Assume that G does not have type D n . By Lemma 6.3(7), s is the central 
involution in L an = SL(2,g) or SL(2,g 2 ) (where the central quotient groups are 
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the orthogonal groups Pf2(3, q) or PQ~(4, q)). By Theorem 4.36(b), s has length 
0(\ogq) in our generators. Thus, the additional relation s = 1 produces the desired 
presentation. 

If G has type D n we use (L an _ 1 , L an ) = SL(2, q) x SL(2,g). This time s is 
the central involution lying in neither SL(2, q) factor. Once again s has length 
O(logg) in our generators, and the additional relation s — 1 produces the desired 
presentation. 

By Lemma 6.3, we have now provided a short bounded presentation for Gj (s) 
in all cases; this proves Theorem 6.8 when G is f2(2n + 1, q), Pf2~(2m, q) with m 
even, or Pfl e (2m,q) with m odd and q = —el (mod 4), where e = ±. 

6.4.2. D n {q) with n even. Here we merely repeat the argument in Case 2 of the 
proof of Theorem 6.4, this time using the presentation for G\ = SL(n, q) provided 
by Lemma 6.6(i). The additional relation z = 1 produces the desired presentation. 

At this point we have a presentation for G = P£! + (2n, q), as well as for its 
universal cover G. The center of G is elementary abelian of order 4. Starting with 
G, use Proposition 2.4 twice (with p = 2) in order to obtain a presentation for G in 
which we have words of length 0(logn + log q) for the generators a, b of Z{G). Then 
each group Gj (a), Gj (6), Gj (a, b) can be obtained by adding one or two relations 
of length (9(log n + log q) . (Of course, one of these groups is Gj (s) , which was dealt 
with in Section 6.4.1.) 

Note that earlier we passed from the universal central extension of a simple 
group to quotients of it. The above argument went in the opposite direction: from 
a simple group to all perfect central extensions, in the case where the center of 
the universal central extension was bounded. This same idea already occurred in 
Corollary 3.23. 

6.4.3. Factoring out —1. It remains to consider the orthogonal group YQ e (2m,q) 
where m is odd and q = el (mod 4). 

It is convenient to rename G as G/(s) = Q. e {2m,q). We will obtain a new 
presentation for G in order to factor out (—1). 

Case 1: G = _ (2m, q) with m and q odd. Here the rank is n = m — 1, which is 
even. Use Lemma 6.6(i) to obtain a bounded presentation of length O(logn + logg) 
for G\ = SL(n, q) such that its central involution z has length (9(logn + \ogq) in 
the generators. 

Use this presentation for G\ in Case 1 of the proof of Theorem 6.4 in order 
to obtain a presentation for G of length 0(logn + \ogq). Pass modulo the spin 
involution as in Section 6.4.1 in order to obtain a presentation for G. We still need 
a short bounded presentation for Gj (— 1). 

Let ei, . . . , e m , /i, . . . , f m be a basis of the F 9 -spacc underlying G, where ei, . . . , 
e„, /i, ...,/„ is a hyperbolic basis of its span and is perpendicular to the anisotropic 
2-space (e m ,/ TO ). We can choose these bases so that L Qi has support (ej,ej + i,/j,/j + i) 
for each i < n. Then G\ = (L ai , . . . , L CCn _ 1 ) = SL(n, q) has support (ei, . . . , e n , 
/i, . . . , /„), and z is —1 on this subspace. Note that (ei, . . . , e n , fi, . . . , f n ) 1 ' = 
(e m ,/ m ) has type £l~(2,g), and -1 E GL((e m ,/ m )) lies in fi _ (2,g) since q = 3 
(mod 4). 

The group (I a „_n L an ) = Vl (6, q) has an element z' with support (e m , f m ) and 
acting as —1 on this 2-space. Use Proposition 5.6 to write z' as a word of length 
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0(logq) in our generators. Now add the relation z = z' in order to obtain the 
desired presentation for Gj (—1) = PQ~(2m, q). 

Case 2: G = £l + (2n, g) with n and q odd. Here the rank is n. Let ei,..., 
e„, /i, . . . , /„ be a hyperbolic basis of V such L a> has support (ej, e i+ i, /», for 
each i < n. Let Gi = SL(n, g) < G preserve each of the subspaces (ei, . . . , e n ) 
and . . . , /„). Let i? = SL(n - 1, g) preserve (ei, . . . , e„_i) and . . . , / n -i), 
inducing 1 on (ei, . . . , e„_i, /i, . . . , /n-i) 1 - = (e„, /„). 

Use this subgroup i? in Lemma 6.6(h) in order to obtain a presentation for 
Gi such that the central involution z of H has length 0(logn + logg). Use this 
presentation for G\ in Case 2 of the proof of Theorem 6.4 in order to obtain a 
presentation for G of length 0(logn + logg). Pass modulo the spin involution as 
in Section 6.4.1 in order to obtain a presentation for G. 

The element —1 e GL((e„, /„)) lies in f2 + (2,g) since g = 1 (mod 4). This time 
(I tt „_D L an ) = Q+(4, g) has an element z' with support (e„,/„) and acting as — 1 
on this 2-space. Use Theorem 4.36(b) to write z' as a word of length 0(log g) in our 
generators. Now add the relation z — z' in order to obtain the desired presentation 
for G/(-l) =Pfi + (2n,g). □ 



6.5. Unitary groups. We handle unitary groups as in Section 6.2, once again 
choosing a presentation more carefully. 

Theorem 6.9. All groups SU(N,q)/Z, where Z < Z(SU(N,q)), have bounded 
presentations of length 0(log N + log q) . 

Proof. As above we rename G as SU(iV, g), and let V be the underlying ¥ q 2 -space. 
Let n — [N/2], and let the vectors e\, . . . , e n , f\, . . . , f n be a hyperbolic basis for 
the subspace they span; moreover, let these be perpendicular to a final basis vector 
v if N = dim V is odd. We will focus on the case where N = 2n + 1 is odd, the 
even case possibly being slightly easier. We write matrices with our basis ordered 
6i , • • • , e n , /i , . . . , f n , v. 

We can choose the basis so that L ai has support (ej, ej + i, fi, /i+i) for 1 < i < 
n— 1, and (e„, /„, v) when i = n. Then the support of G\ = SL(n, q 2 ) is (ei, . . . , e n , 
/i, . . . ,/„), while the support of G 3 = SU(5, q) is (e n _i,e n , /„_i, f n ,v). 

In Case 1 of the proof of Theorem 6.4 we used the presentation for G\ appearing 
in Theorem 6.1. This time we use the presentation for G\ deduced from it via 
Lemma 6.2, using m = n - 1, Z = (el) and H = (e) < F* 2 . Here H rn = H 11 - 1 
is embedded in SL(n, q) as the diagonal matrices diag(ei, . . . , e n , e\, . . . , e n , 1) with 
€i G H and f\ i ei = 1 (note that e _1 = e: this matrix is, indeed, an isometry); while 
S m — S n -i is embedded as all matrices 



o 



V 



o 

±i 

o 

o 



o 





±1 



o 



using permutation matrices 7r of sign ±1. Lemma 6.2 produces a new generator 
d in a slightly modified short bounded presentation for Gi; the matrix for d is 



diag(e, . . . ,e,e 2 l,e, .. 
used in that lemma). 



, e,e 2 ",1,1) (where we have reordered the coordinates 
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Since e G Z(G) we have e 2n+1 = 1, and there is an element d! € G3 with matrix 

diag(l,...,l,e"-\e,l,...,l,e"-\e,e) 

such that dd' = el. Express d' as a word of length O(logg) in the generators for 
G3 by using Proposition 5.6. The additional relation dd' = 1 produces the desired 
presentation for the quotient group Gj (el) = G/Z. □ 

6.6. Perfect central extensions. The Curtis-Steinberg-Tits presentation pro- 
duces the universal central extension of a group of Lie type. Once we obtained 
a version of this presentation, we had to factor out all or part of the center. In the 
process we proved the following additional 

Theorem 6.10. Each perfect central extension of a finite simple group G of rank 
n over W q has a bounded presentation of length 0(\ogn + logq). 

Proof. In the preceding parts of Section 6 we proved this theorem for the various 
classical groups. We dealt with the alternating groups in Corollary 3.23. □ 

7. Theorems B and B' 

In this section, we prove Theorems B and B' (Holt's Conjecture for simple 
groups). We recall the following well-known observation (cf. [Ho, Lemma 1.1]): 

Lemma 7.1. If a finite group G has a presentation F j R with F free and R normally 
generated by r elements, then d\m^ p H 2 (G,M) < rdhnw p M for any ¥ p G-module 
M. 

Proof. A short exact sequence 

1 -> M -» E -» F/R -> 1 

is determined (up to equivalence) by a G-homomorphism R — > M. Since R can be 
generated by r elements (as a normal subgroup) , any homomorphism is determined 
by the images of the r generators. Thus, there are most \M\ r such homomor- 
phisms and therefore at most that many inequivalcnt extensions. Consequently, 
dim Fp H 2 (G,M) < rdim Fp M. □ 

Proof of Theorem B': By the lemma, Holt's Conjecture follows from Corollary A' 
except for the case 2 G2(3 2e+1 ), which we now handle. 

Note that, if D is a subgroup of G that contains a Sylow p-subgroup of G, then 
the restriction map H 2 (G,M) — > H 2 (D,M) is injective [Gru, p. 91]. 

In order to use this, we consider the cases p — 2, p — 3 and p > 3 separately. 

If p > 3, a Sylow p-subgroup D of G is cyclic, and hence can be presented with 
one generator and one relation. Applying the preceding lemma to D yields the 
result for G with constant 1. 

A Sylow 2-subgroup of G is elementary abelian of order 8, and hence has a 
presentation with 3 generators and 6 relations. 

Finally, for p = 3, by Proposition 4.33 a Borel subgroup of G has a presentation 
with a bounded number of relations, whence the above lemma produces a bound 
in characteristic 3 as well. □ 

We will provide a different argument for this Ree case in [GKKL1]. 
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Working in the profinite category, Theorems B and B' are equivalent. Namely, 
in [Lub2] there is a formula for the minimal number r(G) of relations needed for a 
profinite presentation of a finite group G: 

( r dim H 2 (G,M)- dim H^CM)! , \ 

(7.2) r(G) = sup sup ( i > + d{G ) - C m) , 

where d(G) is the minimum number of generators for G, p runs over all primes, M 
runs over all irreducible F p [G]-modules, and £m = if M is the trivial module and 
1 if not. Note that dim H 1 (G,M) < d(G) dim M: every element of Z 1 (G,M) is a 
map G — > M that is completely determined on a generating set of G. Hence, (7.2) 
implies that 

(7.3) h(G) < f(G) < h(G) + d(G), 
where 

dimff 2 (G,M) 

WG := sup sup — — . 

V ; p M dimM 

Since d(G) = 2 for every finite simple group, (7.3) proves our assertion that Theo- 
rems B and B' are equivalent. 

8. Concluding remarks 

1. One of the purposes of [BGKLP] was to provide presentations for use in Com- 
putational Group Theory This has turned out to be essential, for example in 
[KS1, KS2]. When used in these references, those presentations led to efficient al- 
gorithms to test whether or not a given matrix group actually is isomorphic to a 
specific simple group. Short presentations are also essential in [LG, KS2] for glu- 
ing together presentations in a normal series (essentially a chief series) in order to 
obtain a presentation for a given matrix group. 

We expect that versions of many of the presentations in the present paper or 
[GKKL2] will have both practical and theoretical significance in Computational 
Group Theory. 

2. As mentioned in the Introduction, an elementary counting argument shows that 
our 0(log n + log q) bound in Theorem A is optimal in terms of n and q. However, 
in terms of n, p and e (where q = p e ) one might hope for presentations providing 
a slightly better bound: (3(logn + logp -I- loge). We have no idea whether such 
presentations exist even for PSL(2,p e ). 

3. A standard topological interpretation of Corollary A' states that all finite simple 
groups (except perhaps 2 G2(q)) are fundamental groups of 2-dimensional CW- 
complexes having a bounded number of cells. 

4. As suggested in the Introduction, it is not difficult to check that fewer than 1000 
relations are required in Theorem A. It would be interesting to have far better 
constants in all of the theorems. On the other hand, it would also be interesting to 
have presentations in Theorem A having only 2 generators, even if the number of 
relations grew somewhat (while remaining bounded); compare [GKKL2]. 

Earlier we mentioned Wilson's conjecture that the universal central extension of 
every finite simple group has a presentation with 2 generators and 2 relations [Wi] . 
In view of a standard property of the Schur multiplier M(G) [Sch], this would be 
optimal for a bounded presentation and would imply that every nonabclian finite 
simple group G has a presentation with 2 generators and 2 + d(M(G)) relations. 
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The only infinite family for which the conjecture is presently proven consists of the 
groups PSL(2,p) [CR2]. Wilson essentially proved the conjecture for Sz(q), q > 8, 
in the category of profinite presentations. 

5. Finally, in a similar vein, we note that there does not appear to be a known 
presentation of SL(2,p), when p is prime, having k generators and k + 1 relations 
for some k, and also having length O(logp). In Section 3.1 we already observed 
that the presentation of Sunday [Sun] is not this short. 

Acknowledgement: We are grateful to the referee for many helpful comments. We 
also thank Ravi Ramakrishna and Joel Rosenberg for assistance with Lemma 4.23, 
and James Wilson for checking our computations in Table B.l using GAP. 

Appendix A. Field lemma 
Proof of Lemma 4.23. Assume that ab ^ 0. Then (4.24) is equivalent to 
(A.l) a 2 b + b 2 a = ab, aa + bb=l. 

Taking norms of both sides of the first equation and inserting the second yields 
a 2 a 2 bb + a 3 b 3 + a 3 b 3 + aab 2 b 2 = aabb(aa + bb) 

= a 2 a 2 bb + aab 2 b 2 . 

Therefore, (ab) 3 E F q 6 with 6 q = -6 ^ 0. 

Consequently, ab — kdio with k G ¥ q , u> g F and u 3 = 1 = ujoj, where u = 1 if 
q ^ — 1 (mod 3). Substituting b = k9uj/a into (A.l) gives 

—a^kOu^ 1 1 a + a(k9oj/a) 2 = —akOu" 1 /a, aa — k 2 9 2 /(aa) = 1, 

which imply that 

k9 — a(a — a/a), k 2 9 2 — aa(l — aa). 

Then —k 2 9 2 = k9k9 = aa(a — a/a)(a — a/a), which simplifies to 

(A.2) 2(aa) 2 = a 3 + a 3 . 

Any solution of (A.2) yields (3,q + 1) solutions of (A.l) corresponding to the 
different possibilities for u. For example, using k := a(a — a /a)/ 9 and b := kOuu/a 
with uj 3 = 1, we find that k € ¥ q since 

k — —a(a — a/a)/9 = a(a — a/a)/9 = k 

by (A.2); and hence 

a 2 b + ab 2 — ab = b ■ ab/u; + ab 2 

= b[aoj ■ (a — a/a)u) + a(a — a/a)ui] 

= buj[2a 2 a 2 - a 3 - a 3 ]/aa = 0. 

If q is odd then F g 2 = ¥ q [9], so we can write a = x + y9 with x,y E¥ q and (A.2) 
becomes 

2(x 2 -y 2 6 2 ) 2 = 2(x 3 + 3xy 2 9 2 ). 

The left side is homogeneous of degree 4 and the right side is homogeneous of 
degree 3. This implies that this equation defines a curve is of genus 0, and a 
parametrization can be built using the fact that each line x = ty intersects the 
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curve at (0, 0) and at most one other point. Letting x = tf(t) and y = f(t) we 
obtain the equation 

2f(t) 4 {t 2 - 6 2 ) 2 = 2f(tft{t 2 + 3# 2 ), 

which has solutions 

/(*) = *§^rfp and /(*) = «■ 
(t 2 + w 2 ) . n . 

(with a related formula for 6) is a parametrization of the curve. 

The map t aa is a rational function over F 9 of degree 6, therefore aa does not 
generate ¥ q for at most X^fele fc<e ®P k va l ues of t. Moreover, a 3 (or, equivalently, 
a 2q ^ 1 ) is in F g for at most 5 choices t, because 

,3_ +3 (* 2 + 3^ 2 ) 3 



Therefore 



a" = t J 



-L^[ { t 2 + 3e 2 ) + e(M 2 + e 2 )}, 



and the equation t 3 {t 2 + 38 2 ) 3 (3t 2 + 9 2 ) = has at most 5 distinct solutions. Hence, 
the number of t such that a 3 ^ ¥ q and aa generates ¥ q is at least 

q-5-6 Y P fe > 9-5-6(e- l)g 1/2 . 

k\e,k<e 

Hence there exist at least 4 such a whenever q ^ 3, 5, 9, 25. Since aa = a 2q ^ 1 a 2q ^ 1 
generates ¥ q over ¥ p , it follows that a 29-1 generates F over F p . Direct verification 
shows that (A.l) has a solution with the desired properties for q = 9 and q = 25. 

If q is even then (A. 2) is a union of (3, q + 1) lines: a = a, together with a = toa 
and a — u>a if some w £ ¥ q satisfies ui 3 = 1 . If a is any generator of F* then the 
pair (a, 1 — a) is solution to (A.l) such that ¥ q — ¥ p [aa] — Fp[a 29_1 ]. □ 

Our original approach to Lemma 4.23 was very different. First we observed that 
if (3, q + 1) = 1 then (4.24) is equivalent to 

A + B = l, A q+1 + B q+1 = 1 

and hence, after substitution of B = 1 — A in the second equation, to 

A + A = 2AA, 

and this has q solutions in F, because it defines a curve of genus over F g (if we 
view A = x + yd with x, y € ¥ q ). An easy counting argument shows that unless q 
is even, there is always a solution which generates F over ¥ p . 

If 3|<7+1 then F = F g [w], where u 2 +uo+l = 0. This allows us to write a = x+yuj, 
b = u + vlu where x,y,u,v € ¥ q . Substituting in (4.24), we obtain a system of 3 
equations in 4 variables over F g , of degrees 3, 3 and 2. Since the coefficients in 
these equations turn out to be integers these equations define a curve C (or several 
curves) over Q. Using the bounds on the degrees of the equations one can obtain 
an upper bound for the genus g of the curve. It is known that the number of points 
of C over ¥ q is at least q—2g^/q, which for large enough q is more than the number 
of elements in ¥ q which do not generate this field. 

We started studying the above system in order to obtain a nice bound for the 
genus g and to find how large q has to be for the above argument to work. We 
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expected to get a curve of genus 9 or 10 but after doing computations we were 
surprised to find that actually we had 3 curves of genus 0, which allowed us to 
parametrize all solutions of the system. (N.B. If we change one of the constants on 
the right in (4.24) to a generic element in F p , we indeed obtain a curve of genus 9.) 

Appendix B. Suzuki triples 

Proof of Lemma 4.29. Every vector in Z 4 that cannot be reduced using (a) and 
(b) must have one of the following 25 types for some integers a, 6 > 0. (For, by 
Example 4.11, the first and third coordinates must be one of (—a, 0), (0, —a), (a, a), 
(a, a + 1), (a + 1, a), and similarly for the second and fourth.) 



Case 1 
(-a, -6,0,0) 



Case 2 
(-a, 0,0, -6) 



(0,0, -a, -6) (0,-6,-0,0) 



Case 3 
(a, —6, a, 0) 
(a, -6,o+ 1, 0) 
(a + 1,-6, o,0) 

(a, 0, a, —6) 
(a + l,0,o, -6) 
(a, 0, a + 1,-6) 



Case 4 
(0,6, -a, 6) 
(0,6+l,-o,6) 
(0,6,-a,6+l) 

(-a,6,0,6) 

(-a,6,0,6+l) 

(-a,6+l,0,6) 



Case 5 
(a, 6, a, 6) 
(a, 6, a + 1, 6) 
(a + 1, 6, a, 6) 
(a, 6, a, 6 + 1) 
(a, 6, a + 1,6 + 1) 
(a + 1, 6, o, 6 + 1) 
(o, 6 + 1, a, 6) 
(a, 6 + 1, o + 1, 6) 
(o + 1, 6 + 1, o, 6) 

Since Alsz is invariant under the permutation (1,3)(2,4) of coordinates, there 
are only 5 cases that need to be considered on this list: the ones on the second row 
above, in the last 3 columns decorated with some +l's as seen on the remainder 
of the last three columns. That is, we only need to consider the five cases v = 
(-a, -6, 0, 0), (-a, 0, 0, -6), (a + a, b + 0, a + 7, b + 5), (a + a, -b, a + 7, 0), (0, b + 
/?, —a, b + S) with a, b > and a, (3, 7, <5 € {0, 1}. We will present the second and 
third of these cases, leaving the remaining high school algebra to the reader. 

Case 2: Let v = (—a, 0,0, —b) with a,b > 0. Triple (c) has three translates 
containing v: 

1 (-a,0,0,-6) (l-a,-l,0,-6) 

2 (-1-0,1,0,-6) (-0,0,0,-6) 

3 (-0,0,-1,1-6) (l-o,-l,-l,l-6) 

Triple (d) has three translates containing v. 

4 (-0,0,0,-6) (-2-0,1,0,-6) 

5 (2-a,-l,0,-6) (-a,0,0,-6) 

6 (-0,0,2,-1-6) (-2-0,1,2,-1-6) 

Here v is (c)-reducible via 1 iff 
o 2 + 26 2 > (1 - a) 2 + 2 + 26 2 and 
o 2 + 26 2 > o 2 + 1 + 2(1 - 6) 2 , 
hence iff 2a > 2 and > 2 + 46, which never occurs. 

Also v is (c)-reducible via 2 iff 
o 2 + 26 2 > (-1 - o) 2 + 2 + 26 2 and 
o 2 + 26 2 > (-1 - o) 2 + 2 + 1 + 2(-l - 6) 2 , 
hence iff > 2a + 3 and > 2a + 46 + 6, which never occurs. 



(-a,0,l,-l-6) 
(-l-a,l,-l-6) 
(-a,0,0,-6) 

(-a, 0,-2, 1-6) 
(2 -a, -1,-2, 1-6) 
(-a, 0,0, -6) 



Also v is (c)-reducible via 3 iff 
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a 2 + 2b 2 > a 2 + 1 + 2(1 - b) 2 and 
a 2 + 2b 2 > (1 - a) 2 + 2 + 1 + 2(1 - 6) 2 , 
hence iff 46 > 3 and 2a + 46 > 6. 

Thus, the only way v can be (c) -irreducible is to have 46 < 3 or 2a + 46 < 4. 
Hence 6 < 1 . 

Next, x is (d)-reducible via 4 iff 
a 2 + 26 2 > (-2 - a) 2 + 2 + 26 2 and 
a 2 + 26 2 >a 2 + 4 + 2(l-6) 2 , 
hence iff > 4a + 6 and 46 > 6, which never occurs. 

Also v is (d)-reducible via 5 iff 
a 2 + 2b 2 > (2- a) 2 + 2 + 2b 2 and 
a 2 + 2b 2 > (2 - a) 2 + 2 + 4 + 2(1 - 6) 2 , 
hence iff 4a > 6 and 4a + 46 > 12. 

Also v is (d)-reducible via 6 iff 
a 2 + 2b 2 > a 2 + 4 + 2(-l - 6) 2 and 
a 2 + 2b 2 > (-2 - a) 2 + 2 + 4 + 2(-l - 6) 2 , 
hence iff > 46 + 6 and > 4a + 46 + 12, which never occurs. 

Thus, the only way v can be (d) -irreducible is to have 4a < 6 or 4a + 46 < 12. 
Hence a < 3. 

Consequently, all irreducible v of the specified sort have a < 3, 6 < 1 and hence 
M 2 <3 2 + 2. 

Case 3: The 9 types in the last column have the form v = (a + a, b + (3, a + 7, b + 5) 
with a, 6 > and a, /3, 7, S € {0, 1}. Triple (c) has three translates containing v. 

1 (a+a,6+,3,a+7,6+(5) (l+a+a,-l+b+ / 3,a+7,6+ ( 5) (o+a,6+/3,l+a+7, -1+6+5) 

2 (-l+a+a,l+6+^,a+7,6+<5) (a+a,b+ / 3,a+7,6+ ( 5) (-l+o+a,l+6+/3,l+a+7, -1+6+5) 

3 (a+a, b+/3,-l+a+7, 1+6+5) (l+a+a,-l+b+ / 3,-l+a+7, 1+6+5) (o+a,6+/3,a+7,6+5) 

Triple (d) has three translates containing u: 

4 (a+a, 6+/3,a+7, 6+5) (-2+a+a,l+b+,3,a+7,6+5) (o+a,6+/3,-2+a+7, 1+6+5) 

5 (2+a+a,-l+6+/3,a+7,6+5) (a+a, 6+,3,a+7, 6+5) (2+o+a, -1+6+/?, -2+o+7,l+6+5) 

6 (a+a, 6+/3,2+a+7, -1+6+5) (-2+a+a,l+b+,3,2+a+7, -1+6+5) (a+a,6+/3,a+7,b+5) 

Here v is (c)-reducible via 1 iff 

(a+a) 2 +2(6+/3) 2 + (a+7) 2 +2(6+5) 2 >(l+a+a) 2 +2(-l+6+,3) 2 + (a+7) 2 +2(6+5) 2 and 
(a+a) 2 +2(6+/3) 2 + (a+7) 2 +2(6+5) 2 >(a+a) 2 +2(6+,3) 2 + (l+a+7) 2 +2(-l+6+5) 2 , 

hence iff 4(6 + (3) > 2(a + a) + 3 and 4(6 + S) > 2(a + 7) + 3. 
Also v is (c)-reducible via 2 iff 

(a+a) 2 +2(6+/3) 2 + (a+ 7 ) 2 +2(6+5) 2 >(-l+a+a) 2 +2(l+6+^) 2 + (a+7) 2 +2(6+5) 2 and 
(a+a) 2 +2(6+/3) 2 + (a+7) 2 +2(6+5) 2 >(-l+a+a) 2 +2(l+6+,3) 2 + (l+a+7) 2 +2(-l+6+5) 2 , 

hence iff 2(a + a) > 4(6 + f3) + 3 and > 27 - 2a + 4/3 - 4(5 + 6, which never occurs. 
Also t; is (c)-reducible via 3 iff 

(a+a) 2 +2(6+/3) 2 + (a+ 7 ) 2 +2(6+5) 2 >(a+a) 2 +2(6+/3) 2 + (-l+a+7) 2 +2(l+6+5) 2 and 
(a+a) 2 +2(6+/3) 2 + (a+ 7 ) 2 +2(6+5) 2 >(l + a+a) 2 +2(-l+6+/3) 2 + (-l+a+7) 2 +2(l+6+5) 2 , 



PRESENTATIONS OF FINITE SIMPLE GROUPS 



55 



hence iff 2 (a + 7) > 4(6 + (3) + 3 and > 2a — 2-y + 45 — 4(3 + 6, which never occurs. 

Thus, the only way v can be (c) -irreducible is to have 4(6 + (3) < 2(a + a) + 3 or 
4(6 + S) < 2(a + 7) + 3, and hence to have 46 < 2a + 5. 

Next, v is (d)-reducible via 4 iff 

(a+a) 2 +2(6+/3) 2 + (a+ 7 ) 2 +2(6+5) 2 >(-2+a+Q) 2 +2(l+b+/3) 2 + (a+ 7 ) 2 +2(6+5) 2 and 
(a+a) 2 + 2(6+/3) 2 + (a+ 7 ) 2 +2(6+5) 2 >(a+Q) 2 +2(b+/3) 2 + (-2+a+ 7 ) 2 +2(l+6+5) 2 , 

hence iff 4(a + a) > 4(6 + /?) + 6 and 4(a + 7) > 4(6 + S) + 6. 
Also v is (d)-reducible via 5 iff 

(a+a) 2 +2(6+/3) 2 + (a+ 7 ) 2 +2(6+5) 2 >(2+a+Q) 2 +2(-l+b+^) 2 + (a+7) 2 +2(6+5) 2 and 
(a+Q) 2 +2(b+/3) 2 + (a+ 7 ) 2 +2(b+5) 2 >(2+a+a) 2 + 2(-l+6+^) 2 + (-2+a+7) 2 +2(l+b+ I 5) 2 , 

hence iff 4(6 + /?) > 4(a + a) + 6 and > 4a -4-y -4/3 + 4(5+ 12, which never occurs. 
Also v is (d)-reducible via 6 iff 

(a+Q) 2 +2(b+/3) 2 + (a+ 7 ) 2 +2(b+5) 2 >(a+a) 2 +2(b+/3) 2 + (2+a+7) 2 +2(-l+b+ I 5) 2 and 
(a+ Q ) 2 +2(b+/3) 2 + (a+ 7 ) 2 +2(b+5) 2 >(-2+a+a) 2 +2(l+b+/3) 2 + (2+a+7) 2 +2(-l+b+ I 5) 2 , 

hence iff 4(6 + 6) > 4(a + 7) + 6 and > -4a + 47 + 4/3 - 46 + 12, but the latter 
never occurs. 

Thus, the only way v can be (d) -irreducible is to have 4(a + a) < 4(6 + (3) + 6 
and 4(a + 7) < 4(6 + 5) + 6; and /ie«ce to have 4a < 46 + 10. 

Consequently, a^/ irreducible v of the specified sort have 46 < 2a + 5 and 4a < 
46 + 10, and hence satisfy < a < 7, < 6 < 4 and \v\ 2 < 2(8 2 + 2 • 5 2 ). 

We have now bounded the length of each irreducible vector. The bounds are in 
the following table. 

Case a < 6 < \v\ 2 < 

1 9 6 153 

2 3 1 11 

3 12 9 

4 1 2 26 

5 7 4 228 

Instead of considering all cases even more tediously than above, we used a computer 
calculation to obtain all irreducible vectors. □ 

For completeness we list all 139 irreducible vectors in Table B.l. 

A more conceptual way to prove Lemma 4.29 is following. The set Mu is the 
product of two copies of .Mbcrw- Therefore any irreducible vector for Mu lies 
close to (at distance at most 1) one of 9 quarter planes Ai (each quarter plane 
arises as a Cartesian product of 2 rays). These quarter planes are listed in the 
cases in the beginning of this appendix 

The system Ai-\> also can be transformed to the product of two copies of 
•Mbcrw- This implies that the irreducible vectors for M-v also are close to one 
of 9 quarter planes Bj. A relatively easy computation shows that the intersection 
of any two quarter planes Ai n Bj consists only of the origin, which implies that 
the set of irreducible vectors is bounded. 

In order to significantly decrease the size of the set of irreducible vectors we will 
enlarge Msz- This can be done using the following refinement of Section 4.2: 
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Table B.l. The irreducible vectors for Msz 



( — 9 — 6 0) 


( — 8 —5 0) 


( — 7 —5 0) 


( — 7 —4 0) 


(-Q —4 


0) 


( — 6 —3 0) 




( k q n n^ 
^ — o , — O , U , U _) 


( k o n n\ 

{ — o j — j « J 


( a ^ n c\\ 
\ — ^ j — o , u , u ) 


/ 4 2 


u , U ) 


^ ^ g gj 


( 3 3 


^ 3 2 0) 


^ 3 _1 g g^ 


^ 3 0) 


/ 2 2 


0) 


^ 2 1 0) 


^ 2 0) 


^ 2 10 0) 


^ ^ 2 0) 


^ ^ ^ g 


/ i 


i > 


^ 1 0) 


( — 1 1) 


( — 1 1 0) 


(0 —2 1 0) 


(0 — 1 —1 0) 


(0—10 


ni 
U J 


(0—110) 


(0 —9 —6) 


(0 o —8 —5) 


(g g _7 —5) 


(g g _7 —4) 


(0 —6 




(0,0,-6 —3) 


(0 —5 —4) 


(0 —5 —3) 


(0 —5,-2) 


(g g _4 —3) 


(0,0,-4 


—21 


(0,0 —4 —1) 


(0 q 3 —3) 


(0 —3 —2) 


(g g _3 —1) 


(0 —3 0) 


(0 —2 


9^1 


(0 —2 — 1) 


(g g 2 0) 


(g g 2 1) 


(g o ^ 2) 


(0 o 1 1) 


(Q Q ^ 


U J 


^g q ^ ]^ 


( r\ n n i 

^ u , u , u , — 1 ) 


^ U , U , U , U ) 


^ u , u , u , 1 ) 


en n i t\ 


(c\ n i n 

U , U , 1 , u 






(01—10) 


f0 1 0) 


(0 110) 


(0 111) 


(0211 




(1 — 1 0) 


(1,-1,1,0) 


(1,0,0,-2) 


(1,0,0,-1) 


(1,0,0,0) 


(1,0,0,1 




(1,0,1,-1) 


(1,0,1,0) 


(1,0,1,1) 


(1,0,2,-1) 


(1,0,2,0) 


(1,0,2,1 




(1,1,0,0) 


(1,1,0,1) 


(1,1,0,2) 


(1,1,1,0) 


(1,1,1,1) 


(1,1,1,2 




(1,1,2,0) 


(1,1,2,1) 


(1,1,2,2) 


(1,2,1,1) 


(1,2,2,1) 


(2,-1,1 


0) 


(2,0,1,0) 


(2,0,1,1) 


(2,0,2,1) 


(2,1,1,0) 


(2,1,1,1) 


(2,1,1,2 




(2,1,2,0) 


(2,1,2,1) 


(2,1,2,2) 


(2,1,3,0) 


(2,1,3,1) 


(2,1,3,2 




(2,2,1,1) 


(2,2,2,1) 


(2,2,3,1) 


(2,2,3,2) 


(2,3,3,2) 


(3,0,2,1 




(3,1,2,1) 


(3,1,2,2) 


(3,1,3,2) 


(3,2,2,1) 


(3,2,2,2) 


(3,2,2,3 




(3,2,3,1) 


(3,2,3,2) 


(3,2,3,3) 


(3,2,4,1) 


(3,2,4,2) 


(3,2,4,3 




(3,3,3,2) 


(3,3,4,2) 


(4,1,3,2) 


(4,2,3,2) 


(4,2,3,3) 


(4,2,4,3 




(4,3,3,2) 


(4,3,4,2) 


(4,3,5,2) 


(4,3,5,3) 


(4,4,5,3) 


(5,2,4,3 




(5,3,4,3) 


(5,3,4,4) 


(5,3,5,4) 


(5,4,5,3) 


(5,4,6,3) 


(6,3,5,4 




(6,5,7,4) 



Lemma B.l. Let M. be as in Definition 4.9. Let Mi, M2 € and i> S Z fe 6e suc/i 
tftaf Mi n (w + M 2 ) is a sm^Ze point Let M = (M 1 \ (u + M 2 )) U ((v + M 2 ) \ Mi) 
denote the symmetric difference of Mi and v + M 2 . Then any set SCZ 1 that is 
M-closed is also M U {M}-closed. 

Proof. Assume that S is .M-closed. For any u such that \S(~\ (u + M)\ > |M| — 1 we 
have |Sn(u + Mi)| > |Mi| - 1 or \Sf~\(u + v + M 2 )\ > |M 2 | - 1; we assume the first 
of these. Since S is Al-closed we have u + Mi C 5; in particular the intersection 
(u + Mi)n(u + u + M 2 ) is in 5". Since |(u + Mi)n(u + v + M 2 )| = \M 1 n(v + M 2 )\ = 1 
by hypothesis, it follows that \S D (u + v + M 2 )| > |M 2 | - 1. Since S is TW-closed, 
we see that u + v + M 2 C S and hence that u + M C S, which shows that S is also 
closed under {M}. □ 

This lemma is natural from a group-theoretic point of view. For, in our uses of 
Definition 4.9, for each M e A4 there is a relation of the form YimeMi 11 " 1 )^ = 
1. In the preceding lemma we have two such relations Yl me M 1 ( u,n ) ±1 = 1 = 
Yl mG M 2 (u v+m ) ±1 ■ By eliminating the one common term in these products we 
obtain a third relation IImeM( Mm ) ±1 = 1- 

Let M.12 be the system consisting of the 22 sets in the Table B.2. Each of the 
sets S5-S22 in that table is the symmetric difference of the two sets in the last two 
columns, and the preceding lemma can be applied. 
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Table B.2. The set M22 
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f{(\ n n c\\ ( 1 n n c\\ (r\ n i nU 
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So 
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Sk 
*-*5 


Iff) n n nWn n i nWn i n nWn i i -in 


»J1 




Sk 
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{(0,0, 0,0), (1,0, 0,0), (0,0, 0,1), (1,-1,0,1)} 


^1 




s 7 


{(1,0, 0,0), (0,0, 1,0), (2,-1, 0,0), (2,-1, -2,1)} 


Si 


(2,-l,0,0) + S 4 


Sa 
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(0 n 9 -in?, 


Sg 
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Si 


(1,0,2,-1) + S 4 


S10 


4Yn nnol M onn) (o — 1 i m ('9 — 1 — i iu 


^1 




Sn 


{(0,0, 0,0), (0,0, 0,1), (0,1, -1,1), (1,0, -1,1)} 


5 2 


(0,1,-1,1)+S 3 


Si 9 
'-'12 




c„ 
02 




S13 


{(0,0, 0,0), (0,0, 1,-1), (1,-1, 2,-1), (-1,0, 2,-1)} 


5 3 


(1,-1,2,-1) + S 4 


S14 


JYn n n d'i (~] — 1 n m (9 — 1 i — (9 — 1 — i nU 


03 


(O _1 1 _1 "\_|_ c. 


S15 


{(0,0, 0,0), (0,0, 0,1), (1,0, -2,1), (1,0, -1,1), (2, 0,0,0)} 


^2 


(1,0,-2, 1) + S 9 


Sir 
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Sl7 


{(0,0, 0,0), (0,1, 0,0), (1,-1, 0,0), (1,0, 0,0), (0,0, 1,0)} 




(1,-1,0,0)+Si2 


Sis 


( (1,0, 0,0), (0,0, 1,0), (0,0, 2,-1), "1 
I (-1,0, 2,-1), (0,0, 3, -2), (0,0, 1,-1) J 


s 8 


(-2,1,2,-1) + Si4 


Sig 


( (1,0,0,0), (0,0,1,0), (2, -1,0,0), "1 
I (2, -1,-1,0), (3, -2, 0,0), (1,-1, 0,0) J 


s 7 


(2,-l,-2,l)+Si 3 


S20 


( (0,0, 0,0), (0,0, 1,0), (1,-1, 0,0), "1 
I (-1,0, 1,0), (0,0, 1,0), (1,-1, 2,0) J 


Si 


(1,-1,0,0)+Si 6 


S2I 


f (0,0, 1,0), (1,0, 2,-1), (-1,1, 2,-1), "1 
I (0,0, 1,-1), (1,-1, 2,-1), (-1,0, 2,-1) J 


s 9 


S13 


S22 


j (1,0,0,0), (2, -1,1,0), (2, -1,-1,1), | 
I (1,-1,0,0), (2, -1,1,-1), (2, -1,-1,0) / 


S10 


S14 



Table B.3. The irreducible vectors for Al 2 2 



(-2,-1,0,0) 


(-1,-1,0,0) 


(-1,0,0,0) 


(0,-1,0,0) 


(0,-1,1,0) 


(0,0,-2,-1) 


(0,0,-1,-1) 


(0,0,-1,0) 


(0,0,-1,1) 


(0,0,0,-1) 


(0,0,0,0) 


(0,0,0,1) 


(0,0,1,0) 


(0,0,1,1) 


(0,1,0,0) 


(0,1,1,0) 


(1,0,0,-1) 


(1,0,0,0) 


(1,0,0,1) 


(1,0,1,0) 


(1,0,1,1) 


(1,1,0,0) 


(1,1,1,0) 


(1,1,1,1) 


(1,1,2,0) 


(1,1,2,1) 


(2,0,1,1) 


(2,1,1,1) 


(2,1,2,1) 


(2,2,3,1) 


(3,2,3,2) 


(-2,-2,0,0) 


(0,0,-2,-2) 







By Lemma B.l, any set which is closed under Msz is also closed under A^22- 
Consequently, the set of irreducible vectors for .M22 is a subset of the set of irre- 
ducible vectors for A^sz- A computer calculation gives that this system has only 
the 33 irreducible vectors listed in Table B.3 (this involves checking that 106 vectors 
in Table B.l reduce under the new tuples in A^)- 

By slightly perturbing the length function we will further shrink the set of irre- 
ducible vectors without introducing any new irreducible vectors. With respect to 
the length given by |(xi, xo % £3, X4)| 2 = LOlxf + 2x\ + 1.011x| + 2.002x| there are 
only 16 irreducible vectors for AI22, listed in Table B.4. (Even without leaving the 
original collection A^sz, a simpler calculation shows that this new length decreases 
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Table B.4. The set So of irreducible vectors for M22, with respect 
to the perturbed length 



(-1,-1,0,0) 


(-1,0,0,0) 


(0,-1,0,0) 


(0,0,-1,-1) 


(0,0,-1,0) 


(0,0,0,-1) 


(0,0,0,0) 


(0,1,0,0) 


(1,0,0,0) 


(1,0,1,0) 


(1,1,0,0) 


(1,1,1,0) 


(1,1,1,1) 


(2,1,1,1) 




(-2,-2,0,0) 


(-2,-1,0,0) 









the number of irreducible vectors from 139 to 69.) Therefore, using this length we 
have: 

Proposition B.2. The only Msz-closed subset ofZ which contains the set Sq of 
16 vectors in Table B.4 is Z 4 . 

Appendix C. Ree triples and quadruples 

We will sketch the ideas involved in the proof of Proposition 4.33. We use 
the notation for U and introduced before that proposition. Let V denote the 
subgroup x(0, *,*) of U, and let W be the subgroup x(0,Q, *). As in (4.28), if 
(i, j, k, I, to, n) e Z 6 and x G U we write 

(C.l) x (i,j,k,l,m,n) ._ x h with h = h\h 3 c0 hl +1 h l c0 + 1 hf +2 h2e +2 . 

The following trivial identities in F 

(C) + i = (C + i) (0 = 1 + (C + 2) 
(C + i) + i = (C + 2) (C e ) + i = (C e + i) 
(C e ) = 1 + (C e + 2) (C e + 1) + 1 = (C 6 + 2) 

become relations in U/V by (4.27): 

.,(o,o, 0,0, 0,0), ,(i,o, 0,0, 0,0) _ (0,0,1,0,0,0) ,,(1,0,0,0,0,0) _ ,,(0,0,0,0,0,0) (0,0,0,0,1,0) 

Lb Lb Lb Lb Lb Lb 

,,(0,0, 0,0, 0,0), ,(0,0, 1,0, 0,0) _ ,,(0,0,0,0,1,0) ,,(0,0, 0,0, 0,0), ,(0,1, 0,0, 0,0) _ ,,(0,0,0,1,0,0) 

Lb Lb Lb Lb Lb Lb 

,,(0,1,0,0,0,0) _ ,,(0,0, 0,0, 0,0), ,(0,0, 0,0, 0,1) ,,(0,0, 0,0, 0,0), ,(0,0, 0,1, 0,0) _ ,,(0,0,0,0,0,1) 

Lb Lb Lb Lb Lb Lh 

for any u e U. These relations correspond to the following collection Mu of triples 
fromZ 6 : 

{(0, 0, 0, 0, 0, 0), (1, 0, 0, 0, 0, 0), (0, 0, 1, 0, 0, 0)} {(0, 0, 0, 0, 0, 0), (1, 0, 0, 0, 0, 0), (0, 0, 0, 0, 1, 0)} 
{(0, 0, 0, 0, 0, 0), (0, 0, 1, 0, 0, 0), (0, 0, 0, 0, 1, 0)} {(0, 0, 0, 0, 0, 0), (0, 1, 0, 0, 0, 0), (0, 0, 0, 1, 0, 0)} 
{(0, 0, 0, 0, 0, 0), (0, 1, 0, 0, 0, 0), (0, 0, 0, 0, 0, 1)} {(0, 0, 0, 0, 0, 0), (0, 0, 0, 1, 0, 0), (0, 0, 0, 0, 0, 1)}. 

Since 9 2 = 3, we have 

(C 9 ) 6 ^ 1 (C e_1 ) 3 = c (e ~ 1)(e+3) = ( e2 + 20 - 3 = (( 9 ) 2 

The relations 1 + ( 2 + (( + l) 2 + (< + 2) 2 = and ( 2 (C + I) 2 + (C + 1) 2 (C + 2) + 
(C + 2) 2 C 2 = 1 imply 

1 + (o 9+1 (cY +1 + (c + i) e+ V + 1) 9+1 + (c + 2) e+ V + 2) 9+1 = 0, 
(O e+1 (cT +1 (c + i) e+ \c e + i) e+1 + (c + i) e+ \c e + i) e+1 (c + 2) e+ V + 2) 9+1 

+ (C + 2) d+ \c e + 2) 9+ \<;) e+1 (( 9 ) e+1 = 1 
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and two further identities obtained by applying 9, which by (4.32) imply relations 
mV/W: 

v (0, 0,0, 0,0, 0,0)^(1, 1,0, 0,0, 0)^(0,0, 1,1, 0,0)^(0,0, 0,0, 1,1) = i 
v (0, 0,0, 0,0, 0,0) = v (l, 1,1, 1,0, 0)^(0,0, 1,1, 1,1)^(1, 1,0, 0,1,1) 
v (0, 0,0, 0,0, 0,0)^(3, 1,0, 0,0, 0)^(0,0, 3, 1,0, 0)^(0,0,0, 0,3,1) = ^ 
(0,0,0,0,0,0,0) _ (3,1,3,1,0,0) (0,0,3,1,3,1) (3,1,0,0,3,1) 

for any v e V. These relations correspond to the following collection Mv of 
quadruples from Z 6 : 

{(0, 0, 0, 0, 0, 0), (1, 1, 0, 0, 0, 0), (0, 0, 1, 1, 0, 0), (0, 0, 0, 0, 1, 1)} 
{(0, 0, 0, 0, 0, 0), (1, 1, 1, 1, 0, 0), (0, 0, 1, 1, 1, 1), (1, 1, 0, 0, 1, 1)} 
{(0, 0, 0, 0, 0, 0), (3, 1, 0, 0, 0, 0), (0, 0, 3, 1, 0, 0), (0, 0, 0, 0, 3, 1)} 
{(0, 0, 0, 0, 0, 0), (3, 1, 3, 1, 0, 0), (0, 0, 3, 1, 3, 1), (3, 1, 0, 0, 3, 1)}. 

Similarly we have 

(c 9 ) 2 ~ e (c 2 ~ e ) 2 = c (2_e)(0+2) = c 4 ~ 02 = c 

and 

(£2)2-^)2-0 + 1 = (( C + l)Y- 9 (C 9 + I?- 9 

^2-9^2-0 = 1 + (( C + 2) 2 ) 2 - e (C 6 + 2f- 6 

((C + i) 2 ) 2 ~ V + if- 9 + 1 = ((C + i) 2 ) 2 ~V + if- 9 , 

together with additional similar identities; by (4.32) these imply the following re- 
lations in W: 

^(0,0,0,0,0,0,0)^(2,1,0,0,0,0) = ^(0,0,2,1,0,0) ^(2,1,0,0,0,0) = ^(0,0,0,0,0,0,0)^(0,0,0,0,2,1) 

^(0,0,0,0,0,0,0)^(0,0,2,1,0,0) _ ^(0,0,0,0,2,1) ^(0, 0,0, 0,0, 0,0)^(3, 2,0, 0,0,0) _ ^(0,0,3,2,0,0) 

w (3, 2, 0,0,0,0) _ w (0,0, 0,0, 0,0, 0)^(0,0,0,0, 3, 2) ^(0, 0,0, 0,0, 0,0)^(0, 0,3, 2,0,0) _ ^(0,0,0,0,3,2) 

for any w G W. These relations correspond to the following collection Mw of 
triples from Z 6 : 

{(0, 0, 0, 0, 0, 0), (2, 1, 0, 0, 0, 0), (0, 0, 2, 1, 0, 0)} {(0, 0, 0, 0, 0, 0), (2, 1, 0, 0, 0, 0), (0, 0, 0, 0, 2, 1)} 
{(0, 0, 0, 0, 0, 0), (0, 0, 2, 1, 0, 0), (0, 0, 0, 0, 2, 1)} {(0, 0, 0, 0, 0, 0), (3, 2, 0, 0, 0, 0), (0, 0, 3, 2, 0, 0)} 
{(0, 0, 0, 0, 0, 0), (3, 2, 0, 0, 0, 0), (0, 0, 0, 0, 3, 2)} {(0, 0, 0, 0, 0, 0), (0, 0, 3, 2, 0, 0), (0, 0, 0, 0, 3, 2)}. 

We will use the fact that all six systems 

Mu^M-u, Mu^M-v, Mu^M-w, Mv^M-v, Mv^M-w, Mw^M-w 
are of finite type with respect to suitable length functions in order to show that 
[U,U] = 1 (mod V), [U,V] = 1 (mod W), [U,W] = 1, [V, V] = 1 (mod W), 
[V,W] = 1 and [W, W] — 1, respectively. The first system contains several copies 
of -M±bcrw, therefore it is of finite type. The last system can be transformed 
to the first one by a linear transformation, therefore is also of finite type for the 
transformed length function. The next lemma gives the same result for the remain- 
ing four systems; we sketch a proof in Appendix C. As in Section 4.3.3, we can 
avoid using the first and last sets because we can use Lemma 4.1 to show that the 
group (vS h d) is abelian (mod V), and then impose additional relations to make it 
invariant under all h+. 

Lemma C. 2. Each of the systems Mr CCi := Mu^ M-v, Mb. CC2 := Mu^JM-w, 
A^Ree 3 := -Mv U A4-v and A4r o 4 := Mw U M.-v in Z 6 is of finite type using 
suitable length functions for the various systems. 
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Proof. As in the case of Lemma 4.29, again the proof is long and tedious. We use 
two 3-dimensional analogues A^3bcrw and A^bcrw of BCRW: AI3BCRW is 

{(0,0,0), (1,0,0), (0,1,0)} 
{(0,0,0), (1,0,0), (0,0,1)} 
{(0,0,0), (0,1,0), (0,0,1)}, 

consisting of 3 copies of A^bcrw; and A^bcrw' 1S 

{(0,0,0), (1,0,0), (0,1,0), (0,0,1)} 
{(0,0,0), (1,1,0), (0,1,1), (1,0,1)}. 

We use the Euclidean length defined by \(x, y, z)\ 2 = x 2 + y 2 + z 2 . 

Lemma C.3. (1) The irreducible vectors for A^bcrw are as follows (up to 
permutation of the coordinates): 

(—a, 0, 0), (a, a, a), (a, a, a + 1), (a, a, a — 1) 

for integers a > 0. 

(2) The irreducible vectors for .M3BCRW' are as follows (up to permutation of 
the coordinates): 

(—a, 0, 0), (—a, 0, —1), (—a, 0, 1), (a, a, a), (a, a, a + 1), (a, a, a — 1), (a, a + 1, a — 1) 
for integers a > 0. 

Proof. (1) .M3BCRW contains one copy of A^bcrw for each pair of coordinates, 
therefore the projection of an irreducible vector to any coordinate plane is an ir- 
reducible vector for A^bcrw, i.e., is one of (—a, 0), (a, a), (a, a + 1) for some non- 
negative integer a. It is easy to see that the vectors with this property are exactly 
the ones in the statement of the lemma. 

(2) Here we have to consider several different cases depending on the signs of the 
coordinates. We will do just one case and leave the remainder to the reader. 

Let (x,y,z) be an irreducible vector with x > y > z > 0. This vector reduces via 
(x — l,y, z), (x, y, z), (x — l,y+ 1, z), (x—l,y+l, z) unless x < y + 2. Similarly it 
reduces via (x—l,y—l,z), (x,y,z), (x— l,y, z + 1), (x,y— 1, z + 1) unless y < z + 2. 
Therefore the only irreducible vectors of this form are the ones with 

x = y = z or x = y = z + loix = y+ l = z + loTX = y+ l = z + 2. 

The other three cases x > y > > z: x > > y > z and > x > y > z are 
similar. □ 

Proof of Lemma C.2. As in the preceding argument, Mu can be transformed by 
a linear transformation to a product of two copies of A^sbcrw, and My can be 
transformed to A^bcrw ■ We choose the length function so that our transformation 
turns it into the Euclidean length function. 

In order to find all irreducible vectors for the system Aix U A4-y, where X, Y e 
{U, V,W}, we can proceed as follows: First we apply a linear transformation and 
pick a length function such that Mx becomes a product of two copies of A^3bcrw 
or A43BCRW with the Euclidean norm. Any vector which is irreducible for A4x 
lies close (a bounded distance from) one of 16 quarter planes A i} so we only have to 
check which of these vectors are irreducible for M-y ■ After doing the computations 
in each case one obtains a bound for the length of any irreducible vector, which 
proves that the system is of finite type. Unfortunately, the computations are very 
long and tedious and (as in Appendix B) do not provide any insight. 
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Equivalently one can argue that a vector which is irreducible for Ai-y lies close 
to one of 16 quarter planes Bj, and check that the intersection Ai (1 Bj is trivial in 
all cases. □ 

We are now ready to provide the presentation needed in Proposition 4.33. 

Generators: u, v, w, K for * e {(, ( + 1, C + 2, ( e , ( e + 1, ( e + 2}. 
Define x (id^l, m ,n) as in ( CA y 



Relations: 



(1 
(2 
(3 
(4 
(5 
(6 

(7 



(9 
(10 

(11 
(12 
(13 
(14 
(15 
(16 
(17 

(18 
(19 
(20 

(21 
(22 
(23 
(24 
(25 
(26 
(27 



[ft*, ft.] 

„3 



= 1 for*e {C,C + i,C + 2,C 9 ,C 8 

w" = 1. 

^(0,0,1,0,0,0) _ 1IW (i,o,o,o,o,o) i 

ww {0fi, 0,0,1,0) _ w (i,o,o,o,o,o)_ 



l,C e + 2}. 



[ W)W (1>0,0,0,0,0)] = 

[[w m<(I) ]]fct = 1. 

u 3 k + 1 

W V = \\ W X 1 



1. 



]/ lc , where x 3 



is reduced mod m^(x) in order to 



obtain a short relation. 

w (o,i,o,o 

^(0,1,0,0,0,0) 



^(0,0,0,1,0,0) = ww (o,i,o,o,o,o)_ 



(0,0,0,0,1,0) 
W\. 



WW 
V 3 

vv (i, 1,0,0,0,0)^(0,0,1, 1,0,0)^(0,0,0,0,1,1) = W2 

vv (3, 1,0, 0,0, 0)^(0,0,3,1, 0,0)^(0,0, 0,0, 3,1) _ Ws 

v (i, 1,1,1,0,0)^(0,0,1,1,1,1)^(1, 1,0,0,1,1) = vw ^ 

vw$. 



v (3, 1,3, 1,0,0)^(0,0, 3, 1,3, 1)^(3, 1,0,0, 3,1) 

[w Sl ,v] = 1, where Si is the set of irreducible vectors for Aiw U A4-y. 



r 



s 2 



V 



v] = wq, where 5*2 is the set of irreducible vectors for .My U./Vf_y. 

Wi. 



va^i (x) 

'• -[[v f - 



U 3 = V\. 

u (0,0,2,l,0,0) _ uu (2, 1,0,0,0,0) 



w ,for*e{C,C + i,C + 2,C e ,C fl + i,C e + 2}. 



t>2- 



lilt 



(0,0,0,0,2,1) 
^(0,0,3,2,0,0) = 
uu (0,0,0,0,3,2) 



= ^(2,1,0,0,0,0)^ 
^(3,2,0,0,0,0)^ 
= u (3,2,0,0,0,0)^ 



u, u 



(2,1,0,0,0,0)1 



S3 ,w] = 1, where S3 is the set of irreducible vectors for Mu^M-w. 



[u bi , v] — wg, where S 4 is the set of irreducible vectors for Mu^M-v- 

[[u m<(x) ]]h' ( h (t =VB- 



(28) u h * = l[u f * e - 2 ^ x) ]] h(2h(e v* for * e {C, C + 1, C + 2, ( d ,C 6 + 1, C* + 2}. 

Here, ioi,... and Ui,... are suitable elements of W := (i// a11 k*>) and V := 
(i/ a11 W), respectively. These depend on our initial choices (,u,v,w as well 
as on the length functions used to determine the sets Si. 

The proof that the above is a presentation for an infinite central extension of B 
is similar to the Suzuki case. We omit the details. 

There are 46 + \Si\ relations in this presentation, which would probably be 
somewhat unmanageable in practice. □ 
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